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(57) Abstract 



CRSP (Cysteine-Rich Secreted Proteins) polypeptides, proteins, and nucleic acid molecules are disclosed. In addition to isolated, 
full-length CRSP proteins, the invention further provides isolated CRSP fusion proteins, antigenic peptides and anti-CRSP antibodies. The 
invention also provides CRSP nucleic acid molecules, recombinant expression vectors containing a nucleic acid molecule of the invention, 
host cells into which the expression vectors have been introduced and non-human transgenic animals in which a CRSP gene has been 
introduced or disrupted. Diagnostic, screening and therapeutic methods utilizing compositions of the invention are also provided. 



AL 

AM 

AT 

AU 

AZ 

BA 

BB 

BE 

BF 

BG 

BJ 

BR 

BY 

CA 

CF 

CG 

CH 

CI 

CM 

CN 

cu 
cz 

DE 
DK 
EE 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States pa rty to the PCT on the front pages of pamphlets publishing intentional applications under the PCT 



Albania 

Armenia 

Austria 

Australia 

Azerbaijan 

Bosnia and Herzegovina 

Barbados 

Belgium 

Burkina Faso 

Bulgaria 

Benin 

Brazil 

Belarus 

Canada 

Centra! African Republic 

Congo 

Switzerland 

Cote crivoire 

Cameroon 

China 

Cuba 

Czech Republic 
Germany 
Denmark 
Estonia 



F! 
FR 
GA 
GB 
GE 
GH 
GN 
GR 
HU 

IE 

IL 

IS 

IT 

JP 

KE 

KG 

KP 

KR 

KZ 

LC 

LI 

LK 

LR 



Spain 
Finland 
France 
Gabon 

United Kingdom 

Georgia 

Ghana 

Guinea 

Greece 

Hungary 

Ireland 

Israel 

Iceland 

Italy 
Japan 

Kenya 

Kyrgyzstan 

Democratic People's 

Republic of Korea 

Republic of Korea 

Kazakstan 

Saint Lucia 

Liechtenstein 

Sri Lanka 

Liberia 



LS 


Lesotho 


LT 


Lithuania 


LU 


Luxembourg 


LV 


Latvia 


MC 


Monaco 


MD 


Republic of Moldova 


MG 


Madagascar 


MK 


The former Yugoslav 




Republic of Macedonia 


ML 


Mali 


MN 


Mongolia 


MR 


Mauritania 


MW 


Malawi 


MX 


Mexico 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 



FT 
RO 
RU 
SD 

SE 
SG 



Portugal 
Romania 

Russian Federation 

Sudan 

Sweden 

Singapore 



SI Slovenia 

SK Slovakia 

SN Senegal 

SZ Swaziland 

TD Chad 

TG Togo 

TJ Tajikistan 

TM Turkmenistan 

TR Turkey 

TT Trinidad and Tobago 

UA Ukraine 

UG Uganda 

US United States of America 

UZ Uzbekistan 

VN Viet Nam 

VU Yugoslavia 

ZW Zimbabwe 



WO 98/46755 PCT/US98/07894 



uses thSor CYSTEINE_RICH secreted proteins >- nucleic acid molecules encoding them and 

Background of the Invention 

5 Secreted proteins play an integral role in the formation, differentiation, and 

maintenance of cells in multicellular organisms. For instance, secretory proteins are 
known in the art to be involved in signalling between cells which are not in direct 
contact. Such secreted signaling molecules are particularly important in the 
development of vertebrate tissue during embryogenesis as well as in the maintenance of 
1 0 the differentiated state of adult tissues. For example, inductive interactions that occur 
between neighboring cell layers and tissues in the developing embryo are largely 
dependent on the existence and regulation of secreted signaling molecules. In inductive 
interactions, biochemical signals secreted by one cell population influence the 
developmental fate of a second cell population, typically by altering the fate of the 
1 5 second cell population. For example, the Wnt proteins are now recognized as one of the 
major families of developmentally important signaling molecules in organisms ranging 
from Drosophila to mice. For review see Cadigan, K.M. and R. Nusse (1997) Genes & 
Development, 11:3286-3305. 

It is also now well recognized that major families of signaling molecules have a 
20 dual role to play in both the development of an organism as well as in promoting or 
maintaining the differentiated state of tissues in the adult animal. Furthermore, major 
families of signaling molecules have been implicated in controlling proliferation of cells 
in mature adult tissue, for example, during normal cell turnover in the adult organism as 
well as in tissue regeneration activated as a result of damage to the adult tissue 

25 

Summary of the Invention 

The present invention is based, at least in part, on the discovery of nucleic acid 
molecules which encode a novel family of secreted proteins, referred to herein as the 
Cysteine-Rich Secreted Proteins ("CRSPs" or "CRSP proteins"). The CRSP molecules 

30 of the present invention are useful as modulating agents in regulating a variety of 

cellular processes. Accordingly, in one aspect, this invention provides isolated nucleic 
acid molecules encoding CRSP proteins or biologically active portions thereof, as well 
as nucleic acid fragments suitable as primers or hybridization probes for the detection of 
CRSP-encoding nucleic acids. 

35 In one embodiment, a CRSP nucleic acid molecule is 60% homologous to the 

nucleotide sequence shown in SEQ ID NO: 1, SEQ ID NO:3, or complement thereof. In 
yet another embodiment, a CRSP nucleic acid molecule is 80% homologous to the 
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nucleotide sequence shown in SEQ ID NO:4, SEQ ID NO:6, the nucleotide sequence of 
the DNA insert of the plasmid deposited with ATCC as Accession Number 98634, or a 
complement thereof. In yet another embodiment, a CRSP nucleic acid molecule is 60% 
homologous to the nucleotide sequence shown in SEQ ID NO:7, SEQ ID NO:9, the 
nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
v Accession Number 98633, the nucleotide sequence of the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98452, or a complement thereof. In yet 
another embodiment, a CRSP nucleic acid molecule is 85% homologous to the 
nucleotide sequence shown in SEQ ID NO:7, SEQ ID NO:9, the nucleotide sequence of 
the DNA insert of the plasmid deposited with ATCC as Accession Number 98633, the 
nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98452, or a complement thereof. In yet another embodiment, a 
CRSP nucleic acid molecule is 70% homologous to the nucleotide sequence shown in 
SEQ ID NO: 10, SEQ ID NO: 12, or a complement thereof. In a preferred embodiment, 
an isolated CRSP nucleic acid molecule has the nucleotide sequence shown SEQ ID 
NO:3, or a complement thereof. In another embodiment, a CRSP nucleic acid molecule 
further comprises nucleotides 1 -37 of SEQ ID NO: 1 . In yet another preferred 
embodiment, a CRSP nucleic acid molecule further comprises nucleotides 1088-2479 of 
SEQ ID NO: 1 . In another preferred embodiment, an isolated CRSP nucleic acid 
molecule has the nucleotide sequence shown in SEQ ID NO: 1 . 

In another preferred embodiment, an isolated CRSP nucleic acid molecule has 
the nucleotide sequence shown SEQ ID NO:6, or a complement thereof. In another 
embodiment, a CRSP nucleic acid molecule further comprises nucleotides 1-125 of SEQ 
ID NO:4. In yet another preferred embodiment, a CRSP nucleic acid molecule further 
comprises nucleotides 797-848 of SEQ ID NO:4. In another preferred embodiment, an 
isolated CRSP nucleic acid molecule has the nucleotide sequence shown in SEQ ID 
NO:4. 

In another preferred embodiment, an isolated CRSP nucleic acid molecule has 
the nucleotide sequence shown SEQ ID NO:9, the nucleotide sequence of the DNA 
insert of the plasmid deposited with ATCC as Accession Number 98633, the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 
98452, or a complement thereof. In another embodiment, a CRSP nucleic acid molecule 
further comprises nucleotides 1-92 of SEQ ID NO:7. In yet another preferred 
embodiment, a CRSP nucleic acid molecule further comprises nucleotides 891-1529 of 
SEQ ID NO:7. In another preferred embodiment, an isolated CRSP nucleic acid 
molecule has the nucleotide sequence shown in SEQ ID NO:7. 
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CRSP „„c toc «a nio , ecu|e J" W anodrer prefer, embo()i 

- —» add S e, U e„ ce . Ieas , 6 „„ *"* encoding a P ro tei „ ^ 

nucleoude encod * «SP nucIe , c acid m v 

Prote '° " a *« - amino acid s«,„en ""'^ ■»*, a 

*, U en« of SEQ ,D N0:8 . ,„ ya[ ^d, t / """""^ '° "» -*» acid 

20 ~ " nuciaoddc ^Ictnco! e '" b0diraem ' ' « SP — acid 

•".«« a, least 65 % bom„,„ " an an,in„ acid 

~^aC RS P prote :^ n °^^~le ra ,.ofd Kpresra , ' 
aya^ne-ric domain, ^ „ secre ™„ " I' 8 "" " , — "" d « ««- - 

aequence and a. leas, „„ e T"*" "** • -to* 

— «• I" y« amamer •Lta.T.S^'T*-* ' and is 

— and, a „ aIwalIy — encodes a CRSP 

Another embodiment of tu~ • Whence. 
** apecinca,,, de.ee. « SP — « acid moiecniea 

ending „„n. C Rsp prol< , ns Pof . d Native ,„ „ uclaic ac , d 

»*»* hybridizes under s t n"u2l °" ^""^ ' CRS <" — * cM 

*" ~ ~ 
M " sequence shown in s^O II Mn a SEQ ' D N ° : '' » ""''eorides 

-tad* nuance shown in s ^ '° or ,„ nucieoddes , -600 of 

•he nucieodde acne^e of 4= DNA in^of II arinee "' ~"- » 
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deposited with ATCC 

D *A i nS e rt of the „, " A ° Cession Number 98631 
5 «* nucleotide S e q „e„ Jl, C °"*"« » a noeie ' ' """^ » 

"*urin 6 in a Suiteb , ' " S ° " r ° vi<l « « meftod for Jl a v ™» of .he 

ln ^ther embodiment I 1 Prefera ^ * cysteinf ! ' *** Se ^ Ce 

« ta« ab„ u , 60 „ /o '".fcodimer,,, , CRSp * ^ ae,d se , U e„ee ofSEQ 

tamologous ,„ *, amj ; 0 ^ P " rott 'n <« an ^ acjd ° f <? » NO:5. ,„ 
'-"■o/ogous w a, e ami 7' e,n "» « amino acid seQ *' MMi »' Prefen-ed 
homologous ,o , h '" ^ "» """no «idL ° ano " ,a ' f™fcnwi 

^embodimen, of ,„ ■ NO * SEI > < D 

^oio goUs t0 . nucJeo ^ m ° ,ecu,e ^ing a nucIeoU, 1 Pr ° te, ' n " hich » 
nUCjeo ^ese 0uenceofSEQfD No ^;; u -eati eastabout 

• •«« complement thereof. 
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Another embodiment of the imL.- «■ * com P ,em ent thereof. 

CIU or me invention features an isolated rp« D 
encoded by a nucleic aciH^i , u • csaniso,ated CRSP protein which is 

Another embodiment of the inv,nf <■ ^ iU NO J ' OT acom P^nt thereof. 

cni or me invention features an isolated rosD- . • 
encoded by a nucleic aciH mn u i u • lsol ated CRSP protein which is 

eonditions ,o a L" add t T WM-ta 

The CRSP proteins of th. 2 COmp,ement ^reof. 

proteins. The invention further fel . ! ,0 form CR SP fusion 

anotner aspect, the present invention provides a methnH f„ ^ . • 

~" ta • -PC by contacting 1 wogi^l T"° 8 CRS " 

capable of detecting a CRSP mirl<»;« , 10JOg,cal sam P ,e with an agent 

presence of, CRS , ' -* «« *e 

biological sample. " m0 ' e,: '" C • Pr °' e ' n °'P°»P*P.id= la detected in ,he 

25 « nr -T 0 " provides a mah ° d «* *— 

Of CRSP Mi v| (y is ^ „ ^ oiolog^pte " ta 

CRSP activity in f """""^ CRSP "'^ -* *« 

»ivi,, ,„ i^r^rxi" ^rssn-r «- — 

embodiment, the agon, „ an andbody 1 sp "fi a " bT 
another embodiment the anent nj.,. " """^ blnds "° ' CRSP protein. ,„ 
««ni. tne agent modulates express on of CRSP t,„~„j , .• 
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J mi one embodiment, the CRSP m „A , RSP m °dulator to the 

; mb0dto -. *• CRSP tpodutotor* . Mid b ye, 

«*" «d «p resio „ is , deve , by abe™,, CRS p proKin „, 

n«P"~« invention also p ro v,de s ' a I "'"""^ °'*°««™ive disorder 

CRSP aenvi,, to 1 P ° M * "° d dete ™ i °'"8 -he cflbct „ f T 8 "" , '' ndici "°' 
y o, a CRSP protein. " y a impound tha, modulates Ihe 

Other featuresaadadvante.es of, h. • 
WWtog de,aited description and 2™ '" Va, "°" >* ^ •« <he 

25 tamanc^V^^'^^^oeandprodtote^. 

human cSV T* "* CDNA «d predict a™ ■ 

Figure 3 depicts the cDNa 

IDNO: , - 

human C^J T* "* ^ - Predicted an," • 

^ 71,6 nuc '^ide sequence correspond ^ SeqUence of 

-"Ponds to nucleic acids, to 702 of SEQ 
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' «Q .D NO:,3. T.e i^C^TT* ■ "* ' " ™ " 

ID NO: 1 4. 4 corresponds to amino acids 1 to 242 of SEQ 

the alignment of these sequences The Zl In generated f °™ 

•0 proteins of the instant invention ^ ^ ««"~» the CRSP 

domains of human CRSP Signal peotS T " nd functional 

rich domains of a CRSP 'Z TI ™* * * fiJ,ed ^oxes. The cysteine- 

NO.I7. s ^™ eM "«P°"ds to amino acids 1 to 349 of SEQ ID 

AF034208, respectively). 1 (AccesSl °n Nos. U3233 1 and 

molecules having certain conserved struck ^ and ' ^ ° f 

"family" when referring to th e proteintdT . " ^ 

intended to mean two or "ZT ~ * ad m ° ,ecuIes ""be invention is 

stmctura, domain Md havT—'I 0 ' aCid m0leCU,eS ' — « 

•he same or different species. For exam! . TV 6 "* ™ ta «"» «**« 

origin, as well as ^M^ZZSC ' ^ ° f 
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set domai -" — » ■ c RS p 

•oyacine-neh domain- is . „ ro l ' ° "T ^ « ^ «*»«. Pn*^ . 
5 «*> — -.ds, — - . of abou, £ 

acids, more preferably abou| Kld J — P—Ny abou, <0-90 

wluch at leas. abou. 3-20, preferably .boa. si, ' 65 aram0 ° f 

1 0 A prefetred cvs,eto, • I T ° yMim! resid "as. 

* - -.boa, ,3, , 6 , ,, 

position as ,he cys,ei„e residues to taraul C^p ?„ a! I*"" ^ acW 

- in Figure 6 , CRSp . 2 ^ « ^ 'ving SEQ ,D NO:2. P„ r example , 

araia,, aai<l cysteTnettoue T" reSidUeS l<>Cattd * ^ ~ 

" NO:2 (e.g., cysI5 , in CRSp . 2 ^ - braraa, CRSP-1 having SEQ , D 

posi.io naa cys21 , in CR ^ • • '""ted a. (be » relative am/no acid 

located « tbe same re,a,ive amino i d 2 I '" CRSP " 2, ' D N °' 5 ' iS 

andc ys ,57 to CRSP-2 SEOID NO , T " CyS2 ' 9 CRSPI - SEO » NO:2- 
posidon as cys220 to CRSP-1, SEQ ID NO^ Sto f,™" """^ ami " 0 "« 
20 te « leas, ab ou, „ cyaKine ^^Jf**- " *— 6, CRSP-3 
■be cysteine residues to human CRSP Z c^f™ a " ,in ° Kid <* 

«. CRSP-4 bas a, .east *»7£££SLT J"* * * *°™ 
Posidon as fc aysttine ^ ^ ™ s 7 « *■ — amino acid 

A preferred CRSP preteto of " 8 SEQ "> N0:2 ' 
« eneoded by . nucleotide C '" Ven,i0 " " * ^ tei » 

For example, i„ one era J m "^1 „ ! ™»^«-»« gene., 
cysteine-rich domain oonrainin X uTaLi T c0 " Bi " s ■ *■ 

cysteine residues, and a scooTc^^T ' 47 -' M ^ '° N °' 2 . ^in, ,0 
2« of SEQ ,D NO:2, h a vi„g 1 0 ~^in e ab o„ amino acids 201 - 

30 are depicted in P,gure d, ^1"^ c^ " ^ — 
conratna a fe, cyataine-rich doraran conraini™ !h f ' CRSP - 2) proKi " 

^^■•^^jrrit^iT ,4, " 9oofsB9,D 

am.no acids 41-218 of SEQ ID NO-5 navi„» in '" e " nch don,al " containing about 
cystine residues ore depicred in Kg re 6 ,„ IT™ I"*" <*" ""^ ° f 
» (branan CRSP-3) conuins . fira » «SP-3 p rott to 

138 of SEQ 1DN0:8, having lOcys^n^resjduM^H 601113111 ' 118 about a" 1 * 00 acids 85- 
con^ining ahon, amino acid! 1 82-2« s"q 2 d »ain 

V NO - 8 ' havin S 10 cysteine residues 
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main containing about amino ac ids 96- 1 76 of S Fn in kf^ , , , 
P^ferably .avfaZ ^ « more 

« leas, two cysteine-rioh domain, and h„. P Which includ « 

''aoout 15-20, and more preferably about 16 17 18 „ 10 r... . 
are cysteine residues. i n , D _ femd _ ' l7 ' or 19 of the amino aeids 

C-temrina, region of a CRSptrem F^a i ' " >" *• 

15 protein ^to.cy^J^T F ° reXm, " e ' °" e «»*»*«*, a CRSP-I 

NO:2, baving 20 cy^^LT^* ab ° U ' ^ **" ' 4 ™ °"EQ ,D 
embodiment a dS^S Itt " H """ 6 ' >" — 

acids „, of SEQ ,D ^ Z^o" ^ " ^ — *»• — 
Figure 6. ,„ ^Je^^r " - <■ 
20 containing about amino acids 85 26™ D Mn TT™ ' ^ ** "S^" 

cya erne nob region chaining about amino aoids 1-176^0 m NoTb '" ^ ' 
cysteme residues a, the positions indicated in Figure 6 ' ™ 8 ' " 

includes amino aeid residues locared C , ' tT CyS,eine " ri,:h re8iM 

.emrinal ,„ d,. second cysteine „"h d^ T a ^ ' CySKine - ri<!h d ° mai ° <*>« 

more preferably about lun,™' • . ^craoiy about 1 0-65 amino adds, 

.O NO* aad CRSP . 4 prolein cTnCZ ^ftl^ ' 39 -' 8 ' * *> 
SEQ ID NO: II. g about 30,1110 acids 48-95 of 
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herein, . -^2^7 ■ ^'"'"^ 3 ^ "<"«• A * — 

occurs at JS3T rf « ' T" """^ *" 2 ° ^ " ids ** 

preferably about 20-22 amino a HH / eS,dueS ' and more 

■o (c, r „ e , Valtae , u^, .oLt cit p T^rcr r ues 

oaiitct a protein containing such a sequence to a lipid bilaver Fn- 
one embodiment a CRSP 1 ouayer. For example, in 

23 of SEO rn mA., l." r° . * Signal se ^ ence of about amino acids 1 - 

i-v n anothe r embodiment afR9P?nr^ • 

'5 sccuenccofa^^no^ 1-19 of SEQ ID NO-T In^u," C01 "*"". " sig ""' 

3 Protein contains a signal seoucnc „f .k ' eml >°°™>ent, a CRSP- 

fan,,, zi™::zr M inv r on ca ° * - • «*. 

d. n Xnl ^cn^r^ h0m0 '° 8y '° ^ "-«" 

» — a Protei „ - jn^r^*rssr r on ° f crsp - - 

is depicted in Figure 5 The mml- ° herem 33 " C RSP-like-N" 

MO. 13 a coding region containing nucleotides 75-800 of SEO ID NO,V , 
25 to nucleotides I -726 of SEO ID NO- K. a ,, Sty 1U Na 13 (corresponding 

-** aOI-928 of SEQ ID NO:] 3 ' « 

leas, on^l~!7' Z ** 7°"°" *— ' °» "~ ' ^ a, 
embodiment features a CRSP nrote ^ " ' ~ "*»■ A °°«- 

35 CRSP-2, CRSP-3, or CRSP-4) ? ° f *" mVenti ° n <"*- CRSp -l> 

Yet another embodiment of the invention features a CRSPnmt.- u • 
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peptide. AnoUter embodiment feantres a CRSP protein having a, leas, one cysteine-rich 
domatn, preferably a, leas. „„ e cysttine . rich region _ 6 ^1 

lea urea a CRSP protem havtng a< teas! one cysteine-rich domain, preferably a! las, „ ne 
5 y*e,„e.nch region and a signa, peptide, wherein me cvstei „ e . ricl , P region ~ 
teast two cysteme-rich domains and a spacer, ncruaesa, 

Preferred CRSP molecules of the present invention have an amino acid sequence 

.0 2.? r ' " r Q ° N0: "' AS ^ he ™"- 11,6 Km "^-«y henrolols- 
■0 refers ,o a firs, ammo acid o, nucleonde sequence which contain, . su ffl cie „, " 

mtntmum number of idenriea, or equivalent (e.g., an amino acid residue wh ch L a 

— Chiin) reSidUK ° r » « —d aminol id „r 

nucleonde sequence such th„ the fits, and second amino acid „, „„c,e„,ide sequen£es 
share common streotura, domains and/or. common functional activity. ForexaTpTe 

to, about 40% homology, preferably 50% homology, more preferably 60%-70% 
homology across ,he amino acid sequences of Ore domains and contain a. leas, one 
preferab ly ,w„, more preferably rhree, and even more preferably four, five oasx 
stromal domains, are defined herein as sufficient,, homo,og„ns. Furthermore amino 

i^t:; homT^r " share a ^ ■ ■** - — - 

CRSF-! S -7 d i " ter : han6eabl >' herei "- ' "CRSP activity", "biological Ktivily of 
CRSP or fimettonal achviry of CRSP", refers to an activity exerted by a CRSP protein 
» PC ypepttde „, nudeic acid m„,ec„,e „„ a CRSP responsive cell aa deteLn^T 
or m vrrro, accordtng to stendaro technique, ,„ „„ e embodiment, a CRSP activity i ,' 
tree, acttvtry, such as an associate tvim a CRSP-teage, mo,ee„,e. As used hereL a 

suTrc^med :°d'r u,e w " h which a crsp protti " bbds - 

0 CRSptn? , ™ °" ' S aC " eiVed ' A CRSP "*« m °'^ <*» >* a non- 

exZ, t h' 3 Pr °' ein ° r P0 ' yMMide ° f *• <™°* -—«. >» - 
e* mplary embodtment, a CRSP huge, molecule is a membrane-botntd protein (e g a 

- U -^^orXRSP«cep^, M ,mcdif^ fonno , s ^ a ^:^ h a 
has been altered such ma, me protein is soluble ,e.g„ recombinant* produced s «h L 

1 CRsTT T ° 0 ' T" ' —* enrbodimt,? 
CR^P targe, , s a second sombre protein molecule (e.g., a "CRSP binding partner" „ 

CRSP subtree"). In such an exempt embodiment, a CRSP binding Lne, clbe a 
second S olub,eno„-CRSPpro K i„ora S eoondCRSPpro,emmo,ecn,.„ e C^r, 
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Alternatively aPPoo 
urrace receptor e o * , g *> fluman CRSP ^ oa P r °teinor 

r „ '» « Pmfened emoodi ' h * ere,n - r ""' 0 " «ny of 

5 , «**r for a member V* - ■ CRSP receplor , , J* «"«» *«, ,„ . ^ 

molecule) • «n • ™ w ftmilv of .• " *>">> of a crcd 

mam . '• ( "> , »««clio„„f aCR „„ yofsi «"^«8 prolog ... CRSP ~ep tor ,a 

, "Nation of f») ">terac,i„„ mlh --J RSP molecule 

-d*^* * mo.ec* , ' ' * - * 

^of^^ 

of «,,.«, " "° n): ® memalion ofco JT ""I***., of w 
of °" " n,0du,a, « i "y Hiding ofc ^'® of exp ressio „ £*' 

«« *» S crip ti „ n I „ ac e„ 8 -' m<,,,C,ion rt^dU^T^"**™** 

'^ration as in the case of c Action 
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5 stimulation „f surviva ,. P gM '° r ceB • "») "-odulanon of cell death, such as 
modulation. ' ( " )re8Ul " me<:ell '» i e™i<>n^nd/„ r(12)imrauile 

K, exposed a, higher ^ ^V** proteins are 

undiffere„,ia,ed ,„ the m^ij^^^ * — - *» h 
during regeneration of ^ P ™>»** developmen, or 

15 undifferentiated eomtcrparts Al« ^*ntiated calls as compared lo their 

Acco ra i„ 6 | y , ^ embodireao, „f IT "'^'""on-spaciflc proteins, 
protems and po, y p e p, ides having a CR P ° .' h "7"°" **~ CRSP 
■« one c^oe-rioh region 2 a CRSP a 'T <*" <"° U >™ « 

20 Che CRSP protein has a, L TJ^L T*\ «*>*«. 
-Prises a, leas, one ^-J^tTL"^?? ^ * 
embodiment, me CRSP protein has a, leas, 1 y ' a " 0ther P refe ™< 

*— «* region compriaes - ta^ZTST ** "» 

■« W another prefer embodiment a CRSptT a ^ * CRSP 
» *,ue„ce. ^^Z ^^Tc^ ^' 
reg,o„, a CRSP ac,iv ity , and an amino ,<J Pr< " ein " aS « 

— -id sconce of SEQ^o" So m^T""* » an 

NO:ll. V • SEQ,DNO:5 .SEQIDNO :8 ,orSEQID 

The human CRSP-1 cDNA wh,Vh « 
» encodes a protein whi eh is apprexta^T , "™°*™°<y 2479 nucleoddes in ,e„g lh , 
human CRSP protein ^^^ 7 "*' " The 

for example, frem ab0 n t m^JL^^tT" ~ " ^ « 
eyateine-rich region comprises a firs, cysteine ! 5 '° N °' 2 ' 71,5 CRSP " ' 
» '05 of SEQ ,D N0: 2 J. second"^ "Ira *T? *» *» — -* .47- 
284 of SEQ l D N0:2 . n, humM cr", ""t ° ma, ° *° ra ab0Ut ^ acids 201- 
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The human CRSP-2 cDNA wh.Vh • 8 en " 8 10 ''- 6 )- 
encodes a protein wh.VK- ,NA > Wn 'ch is approximately 84S n, i •. 
5 human rpcD ,S a PP ro *imateIy 224 amino 7 nuc 'eot,des in Jength 

numan CRSP protein N-termin f "** reS,dUes in ,e <*th The 

The human CRSP-3cDNA wh; h J 9 of SEQ ID NO:5. 

encodes a protein whirl. • ^' Wn,c h <s approximately l5?o n , , . 

'38 of SEQ ID NO -8 domain fa™ .1. 3 

2" of SEQ ZD Na 2 t T"" 1 *-* Clf an " n0 " <: " i5 M - 

20 U 2, 11,6 human CRSP 7 „ uirom about amino acids 1 S? 

The human CRSP-4 C DNA wh- u • f SEQ fD N 0:8. 

encodes a protein which ,S ^Proximately 702 n,,.i 

One aspect of the invention „ - • 
-ode CRSP proKjns „ bj « ~ <o isoUfcd niKleic ^ 

mutation of CRsp ^ for as P C R " f ^ nucJe " 



WO 98/46755 

PCT/US98/07894 

- 15- 

molecule" is intended to include DNA molecules (e . pnwi 

RNA molecules (e g mRNA^ a „H , T 8 " A ° r gen ° mic DNA > 

shanded, bu, preferably is double-slm^ DNA Sm8k - S,ra " ded « *«► 

"isolated- J£1~E? '" " a "" i " """ ° f *" " UC,eiC aCid ' - 

orgamsm fen, which „,, nuc|ejc ^ teiv ~ ^ « »* 
10 the isolated CRSP nucleic acid m „l. I ^"""""P 15 . '» various embodiments, 

' * 0.5 icb or 0, To nuXr' l T" *" 5 * 4kb ' 3kb ' 2kb 
molecule in genomic DNA o f Th TT" ""^ "* ° UCkic «" 

» ^ated^rm :r i:::r N r r; acid is * - - 4 m — 

of other cellular material or cult,, !r ? "* ** «*™«Wly fee 

synthesized. cnemicals when chemically 

Ote DNA insert of the plaamid deposited with ATtX^as A "If** 

portion (hereof can be isolate T Accession Number 98452, or a 

-*« InfonnTon ^ h^ta ullT'^ - te 

nuance of fc DNA insert ofthe pI aamid depo L TaTO as a ° 
°S634, the nucleotide sequence of lha nua • . Accession Number 

« Accession Number M6mT1 T a P,aSmid dep ° Si,ed " ith 

ph>smid deposi Jwi h Ircr a DNA tase " ° f «* 

H^rUhom^^cXr;^^^--"^ 
me DNA ,nsert of ^ P'asmid deposited with ATCC as 
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Accession Number 98633, or the nucleotide sequence of the DNA * 

deposed with ATCC as Accession Number 98452 1 h T ° f PJaSmid 

chain reaction (PGR) using synthetic oZ ■ I ,S ° ,ated by the P °'* mer ** 

sequence ofSEQ ID NO: ' ^^T^T^T ^ « * 

5 nucieotide sequence of the DNA nser^ * > ' " ' EQ IDN ° :, °' 0r < he 

Accession Number 98634 1 ■ „ P ^ dCP ° Sited With AT CC as 
u«ioer y»oi4, the nucieotide sequence of the nw a • 

depose with ATCC as Ntunber 98633 or t t "* PhSmid 

DNA inset, of lhe plasmid !' *" " UCle0,ide «* *» 

A nucleic acid of the hL? c CCeSS '°" Number 98 "52. 

>0 alternative,,, emo ^ A "l 7™* ^ ** « 

be cloned into m approprjiK ^ ^ e nucletc acd so amplified ^ 

Forrhetmore, o,ig„„„ cleotjdes co^^rTp 17^"" 
prepared by standard svntheti, wi. ""decode sequences can be 

compnaea the nocieoUde sequence shown i„ SEQ D NO ' 7 " "* i"™"" 0 " 
NO:i corresponds ,„ fi* human CRSP-I cDNA tZdna * SE ° '° 

encoding me human CRSP. I prote i„ ,t e ... h , ' ' A C ° mpnses «l"»«s 

' 087). as well a s 5' untranslated s,Tu , ™ 8 " !S " ,n "' fr ° m nud «"«« 38- 
0 sequences (nucleotides ^S~?T" k ' " * 

corresponding ,„ SEQ ID NO-3) A ~r " ucle< "«'« 38 to 1 087, 
sequence encoding cRsp., 

(ATCC,, Rocltville, Maraud, on Z U It 7^" ^ °— C °' tai °" 

; M452. A plaamid ^ ^e„l , aSS ' 8 ° ed 

*T 16 - ">« - -igned Accession Number 98634 '"^ M " 5 "-* » 

<nv Jn^CZteTut;:,?" 1 ™- " ^ —» «* — * 
SEO m »o, nucleotide sequence shown in SEQ ID NO-4 Th. 

SEQIDNO.-4co msp „ n d slolhehunlal , c -n, „° ">= »q"ence of 

nuances encoding me human CRSP-2 proJin (i e Z JT **" 
nucleotides 126-796) as well „. <• . , (l e " ,he Md, ng region", f rom 

~ed ^:^™™zir s ' » ,2s > - * 

molecule can comprise only the A " en,a " vel >'. nucleic aeid 

nucleo.,de sequence shotvn i„ SEQ ,D NO:7. The sequence of 
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SEQ ID NO:7 corresponds to the human CRSP-3 cDNA. This cDNA comprises 
sequences encoding the human CRSP-3 protein (i.e., "the coding region" La 
nudeondes 93-890), as we,, as 5" untranslated sequences (nucleotides , to 9 2 rand 3" 
untrans ated sequences (nucleotides 89, to ,529). A,ternative,y, thenuc.eic acid 
molecule can comprise only the coding region of SEQ ID NO:7 (e.g., nucleotides 93 to 
890, correspond to SEQ ID NO*). A p.asmid containing the full-.ength nudeot de 

:z~z m rsp ; 3 7 deposited with * Americ - t - «i coiiir; 

98633 ' ° n ^ ,6 ' 1998 "* aSSigned Accession N -*er 

In another preferred embodiment, an isolated nucleic acid molecule of the 

s7 Q ™0 P r T ,e0tide S6qUenCe ShOWn ^ SEQ ID N ° : 1 °- ™ e ~ of 
SEQ ID NO.10 corresponds to the human CRSP-4 cDNA. This cDNA comprises 

sequences encoding the human CRSP-4 protein (i.e., "the coding region" from 

nuclides ,-537), as we,, as 3' untreated sequences (nucleotides 538 ^02) 

NoToT, I T 3Cid m ° ,eCUle 0311 C ° mPriSe ° n,y thC COding «*» of SEQ ID 
NO.10 (e.g., nucleotides 1 to 537, corresponding to SEQ ID NC-12) 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of the nucleotide 
sequence shown m SEQ ID NO:,, SEQ ID NO:4, SEQ ID NO:7, or SEQ ID NO 10 the 
nucleotide sequence of the DNA insert of the pl asm id deposited with ATCC as 

deposited With ATCC as Accession Number 98633, the nucleotide sequence of the DNA 
« of ft. plasmid deposited with ATCC as Accession Number 98452, or a ^ on of 
-yofftese ^nucleotide sequence, A nucleic acid molecule which is compleme^ t o 
ID NO.10 or the nucleotide sequence of the DNA insert of the plasmid deposited with 
o^Imidd ,T- N r bCr 986H ^ nUC,e ° tide SC<1UenCe <**" DNA insert of the 

ofT^T Tu ATCC " ACCCSSi0n NUmber 986 "' ° r *" — ide ~ 
of the DNA insert of the plasmid deposited with ATCC as Accession Number 98452 

NoT^ID^or t0 nUCle ° tide sec l uence shown in SEQ ID 

Tdma ?l J Q ID N0:? ' ° r SEQ ID NO: 1 °> ° r *• nuc,e ^e sequence of 
the DNA insert of the plasmid deposited with ATCC as Accession Number 98634 the 
nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as ' 

deposited w,th ATCC as Accession Number 98452, such that it can hybridize to the 
nuclide sequence shown in SEQ ID NO: 1, SEQ ID NO:4, SEQ ID NO or SEQ ID 
NO:,0, or the nucleotide sequence of the DNA insert of the plasmid deposited wmn 
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60.65%, «, even more „ ,J * U 0 ^ ™ more preferably ,b„ ut 

homologous to lire nucleotide „„,, v ' °" 85 / "- or ">ore 

-0 ID NO:7, orSEQ ID N TTZZZT" " "* '° SK > ' DN °* SE Q 
^ . ,menucleotlde sequence of th^HM a • - 

deposed with ATCC as Accession Number 98634 *! , ° f ^ PlaSmid 

^ert of the plasmid deposited with ATCC L a ^ ° f the DN A 

sequence of the DNA insert of Trl H , °" 98633 ' nuc '«>tide 

osm he P ,asmid depos ted with ATPP a 

98452, or a portion of any of these nucleotide sequences ^ 
Moreover, the nucleic acid molecule of a. • 

NO:,0, me nucleotide seance of L DNA ^ '° SEQ '° 
^ Accession Number 98634 the nucleotid, P m ' d d<!P ° Si,ed *** AT « 

deposited wilh ATCC as Acee^oTNumber ^8633^ "th^^ ms art of the plasmid 
» DNA insert „f me plasmid deposited N ^ r T 9 ^ « """"ride -*« of me 
example a fctgmeut „„«„ ^ ured a!a " NUmber 98 " 52 - ** 

biologically active Pomon of a CRsCteT^e T"" " fa8men ' MMdi " 8 » 
from me cloning of me human CRSP . ,', SeqUe " Ce de " ! ™"od 

Primers designed for J 1^ ""^ for "* •"»"*■» ■* P">bes and 

' e,., ^ omerT s ::r::~7 cMp f —^- * — - 

oligonucleotide tynreallv mm • P oligonucleotide. The 

about 40, 50 or 75 consTu v ' *« * — Mr 

B> NOt4, SEQ ID NO:7, SEQ ,^0 0 */ T'-'T" '° ^ SE< 3 

•he plasmid deposed w th ATCC as A « "** DNA of 

o^DNAin^ofmeZTd ZZZl^T' ^ ~ 
•he nucleotide sequence of the DNA in Jot Zr^ 1 1 " ' 8M3 ' 

Accession Number 98452 of an an* ' eP ° Si,ed Wilh ATCC «■ 

Plasmid deposited ^TCC f *" ° NA ° f 

- ONA insert of me ^SSZT^^ ~» ~ of 

101 ATCC 35 ^cession Number 98633, or 



W09».| 6 7SS 
*e™cle„, ideseqilenc 

Accession Number , "«n of 4e , 

5 *•»«* insert of theplas^jj cess ' on Number °S634 (kg ? ^NA insert of the 

Pr °°e s based on the hl 475 SE Q © 

""P'ifieation (e . R r 1^°"°'*^ honj^. b ° mob **»- h a 
"""Prises a 0eOUS Prote ^ 'n P^ST M 8e °° miC ^"C" 

a p- of . dia^os,,'. tes irf™' - - •»-. «*jrs a radioiM, °<*- ■ 

P'otein, such as „ k " f °' "mUto* cells or , f f Probes «■ »• osed as 

Protein" can k ^g^ent encoding a "h,„. • 

"foe plaamid d Num ^ '8634 L , " W of *= P'«nd 

v/Vz-o) an w Cncode d Portion of the CRSP have Previously been w 

CKSP Protein. 
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NO,0, th. nucleotide se , uence « ^j«Q IDNO:4. SEQ ,D NO:,, SEQ ID 
as Accession Number 98634 the ^ P ° Smid "cposited wilh ATCC 

<* *• code and J?™ °" ^"T" N ™"» '"52, due ,o 

encoded by u» nneleoM. * na> * * e 83,116 <*» P^ins as .hose 

™Q.DNO:,0,,he„™^^^ 

'° dep ° S ' ted »• «CC as Accession hJL^ST^,"" '* *" PtemW 

■nser. of ft. pksmid d ATC c LT ' UC ' e °' ide "t the DNA 

nucleotide sequence of*. DNaTs.^ f CeSS " > " 98 « 3 ' « «" 

Accession Number 98452 In tmoT ^ deP ° SiKd "*> A1 "CC as 

*= i„ve„,i„ n ^ a nucleNjde -oleic acid mo,ee ul . 0 f 

' 5 *=9"ence shown in SEO m 8 proKi " an amino acid 

I" addidon .0 J. hn m » CRsp 2 ' 5> * ^ °' SE » ' D N <» 
SEQ ID NO:4, SEQ ,D NO:7, SEQ ,DNO Z * "T" Sh ° W " *" SEQ ' D N0 ^ 
«-« of dr. plasmid depo$ittd « ATCC as^A ^ Se,Ue " Ce ° f "* DN * 

sequenee of ft. DNA insert of ft. J^ZlZ "T"" """^ 98 " 4 ' "» ""cleodde 
» «B3. or the nucleotide s^T^ 'Z T *"* " ACMSSi °" 
ATCC as Accession Number 98^32 i^mb "* P ' aSmid «» 

DNA sequence polyntotphi,™ tha, i d " £?TT ^ ** ^ *• « «- 

polymorphism in lhe C RSP genes ^ exis al 0 t d T ^'"^ S «"> 
5 10 ™°°» allelic variation. As used hl7„ hTT ^ ° ""a 

Wcally result fa Ms variaice ,/. Pr0 '" n -. SuCh Mtu ™l allelic variations can 

^^^J^'^^^T''^^. Anyattd . 
*a. are the result „f ^ ^ J™'* ^'™»n>Wam S in CRSP ge„e s 
of a CRSP protein „ in , ended «■ *■ do no, a„er the fitnctiona, activity 

Moreover, nuclefc acid 1Z ^ ° f invemi °"- 

- *"* which have a JZ^^^?* «*■ - o»ar species, 
SEQ ID NO: , , SEQ ,D NO:4, SEQ D NO 7 Eo ^ ** ""^ <* 
of - DNA inae,, of the p lasm id ^ ™ ~ 

*e nudeottde sequence of the DNA inscn of thetlsl d 5 '°" ""^ ' 8634 ' 
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deposited with ATCC as Accession Number 98452 ■ 

nucte 0 t lde sequence of human CRSP-, Th ■ *"* been identifi ed based ofL 
(SEQ ID NO. ,6) encodes a CRSP , D L -quence of murin e C ^p , 
5 -ino ac ld sequences Qf ~v lng 349 ^ aC] ,, 

region of murine CRSP i ; c 1 de P icte d in Figure 8 ru 7 

Nucleic aciT'!; IS , rePreSented bv SE Q © NO:,8 § * ^ ^ 

««ue. cDNA encoding a CRSP ymUS ' ,ym P h "odes and infl am 

isolatm- ♦ * , g RSP P rot e n(ee aTR^D . w flammatory 

-o at total mRNA f rom . ceIJ , e fl ^ f S ^ P«»ei„) can be obtained b7 

15 ce,, > Ending embryonic cells n t 6 ^ a mami "aJian cell or , h 

total mRNA an M I ° uble strand ed cDNAs M „ >u L human 

Mention is at Iea S M t^T" *n Elated nucleic acid , 

th~ , ■ 5 nuc, eotides in leneth u • ld mo, ecule of the 

25 ^-^^^^^^ 

25 Plasmid deposited with ATCC a mCk ° tide Se <^ce of the DNA Q ° 

^e DNA insert of 2 , ACCeSSi ° n Num ^ 98634 the n , ^ ° f the 

^ nucleoid! * dep ° sited ^ ATCC as Ac!' ^ SCqUence of 

nucleotide sequence of the DNA insert r>f tCi essi ° n Number 98633 or 

Accesszon Number 98452 In other I T, * ^ dCp ° Sited ATCC „ 

8 h yWdizatiM 
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in 6X sodium chloride/sodium citrate . ^ 

washes i„ 0, X SSC, 0. , H SDS " O^C f °"°™ d ~ " "« 

molecule of the invention tha, hybridizes imd, 7 " iS °' att ° " UC " ic Kid 
SEQ ID NO:, corresponds ,o a naT^o- ""'"'^ "» * 

' herein, a ^yl^"™^^ """^ As — 
molecule having a nucleorinf ° leC "' e " tm "> » **A or DNA 

protein). 8 """^ — ■ » _ (e.g., encodes a 

>0 *^*^n^Z^ZZ '''^^ethatehangescanb. 
SEQ ID NO.-7, SEQ ID NO10 th * SEQ ' D N0: 1 ■ S EQ ID NO'4 

Plasmid deposit* as A ^ °' "* DNA of the ' 

*e nucleotide sequence of the DNA I„J It , ACCeSS, °" Number »>«3. or 
'5 Accession Number 98452 ft^T ^ *">° sited ATCC as 

-coded CRSP proteins, j° * — - ~ 

For etutmple, „„ cleo „ de substi e "^" C " OMl al "'">' •* <"* CRSP pro,eins. 
conservative amino Kid substj ~ a """° — *W« MH 
™de in the sequence of S£Q ^ " "°"™ - tesidues cau be 

20 .he nucleotide sequence of the DNA^er, of h e T 5? '° ^ SE ° ' D NO: ">. 
Accession N„ mb er 98634 the nucleoZ , P ,d "^'^ wi ™ A TCC as 

«w i d.ATCCa S A^~bT9^ 0f t DNA ^ ,,0f,hePhSn > i< ' 
^^of^.pIaamiddapos.JCATCc'T T^*"*—^ 
essential- amino acid residue is a residue .h i ? °° Numbar 98 « 2 - A W 

^'O^ut^^bioJJ^^^OfSEQIDNO^orSEQID 

residue is required for biological acuvitv P " Wmia '" ami "° a "« 

conserved a mong u, e CRSp ror exam P ,e . <™>™ acid residues that are 

within cysteine d™J^l^l ,T " m,n "° n **• ^ 
3» Funhettoore, antino " ° ^ i " , ' ar,> ' — * » 

proteins haviog cysteb e- rich Z n I Z H CRSP P "" eta - <*- 

Acting,* another ZZZ? ' * "— * » a,K ">' ">"• 

essential for activity Such C^P * """ n "* > add rcsidues *« are no, 

one embodiment, the isolated nucleic acid ™L . rCtam bi0, ° 8iCaJ aCtivit y- 
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60% homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID NO 5 SEQ ID 
NO: , or SEQ ID NO: ,1 . Preferab.y, the protein encoded by the nucleic acid moleie 

85 Qno/ k 1UNU - 8 ' orSE Q ID NO:Il,evenmorepreferabIy atleastabout 

85-90 /0 homologous to SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, or SEQ ID 
NO:l 1, and most preferably at least about 95% homologous to SEQ ID NO-2 SFO in 
NO:5, SEQ ID NO:8, or SEQ ID NO. l 1 . * Q ° 

10 , • ^ot 0 ^ nUC,dC m ° ,eCU,e enC0ding 3 CRSP P rot ™ homologous to the 

r 8 ° r m ° re nUde0tide SUbstitutio -, additions or deletions into 
the nucleotide sequence of SEQ ID NO,, SEQ ,D NO:4, SEQ ID N0.7, SEQ ID 
NO.10, the nucleot.de sequence of the DNA insert of the plasmid deposited with ATCC 
as Access Number 98634, the nucleotide sequence of the DNA insert of the plasn^d 

DNA msert of the p.as mi d deposited with ATCC as Accession Number 98452 such that 
one or more ammo acid substitutions, additions or de,etions are introduced into th 

mZn 7 P l°T MUtati ° nS ^ intr ° dUCed im ° SE Q ID NO:1 « SEQ ID NO-4 SEQ 
ID NO:7 SEQ ID NO: 1 0, the nucleotide sequence of the DNA insert of the plasmid 
-0 deposited with ATCC as Accession Number 98634, the nucleotide sequence of Z DNA 
msert of the plasmid deposited with ATCC as Accession Number 98633 or the 
nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 

PCrilt 7 be ; 98452 : ^ Standard tCChniqUeS ' SUCH 3S ***** — -d 
5 It one " a86neS,S - Preferably ' COTOive ™ n0 acid subs ^utions are made 

5 at one or more predicted non-essential amino acid residues. A "conservative amino a id 

esidue having a similar side chain. Families of amino acid residues having similar side 

chams (e.g., lysine, argmine, histidine), acidic side chains (e.g., aspartic acid, glutamic 
> acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine 
threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine 
.soleucme, proline phenylalanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine 

m a CRSP protem (e.g., one no, located in a cysteine-rich domain) is preferably replaced 
another embodiment, mutations can be introduced randomly along all or part of a CRSP 
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eened lor CRSP b.oIog.cal activity to identify mutants that retain activitv f«h • 
mutagenesis of SEQ ID NO: 1 , SEQ ID NO:4, SEQ ID NO 7 SEO ID Nmn u ' 

5 Accession Number 98634, the nucleotide sequence of the^DNA insert T ■ 
deposited with ATCC as Accession Number 98633 Jl t 

^^^^^^^^^^^ 
um-ceUutar calcium, an increase in phosphatidylinosito, or other moSe L 

. • 1 8 *' CKSP- 1 ) to a receptor; (4) reeulatp 

20 maintain oiyW^h *«o*;„i ' * 0 ' lorm and/or 

aintain ordered spatial arrangements of differentiated tissues in vertebrates- m 

coding stra „ d of . nucleotide ^ crTp^T '7 "* ° f "* 

region of human PR qp i ^ j iu oiiy iu MU.6, the coding 

g ot human CRSP-3 corresponds to SEQ ID NO:9, the coding region of human 
CRSP-4 corresponds to SEQ ID NO- 12 an dth*^A- ■ c B.°noinuman 

V iv inu.12, and the coding region of human CRSP-like-N 
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corresponds to SEO ID NO- is^i i» *u 

-ota* is mJLVSZLZ 7**—..*. -ta. nudeie acid 

^enceencodingC^P "w"d 

■ . • — 6 ineterm noncoding reg on" refers tn v a ^ -j. 

which flank the codine reomn tu~ t 3 sequences 

5 -5. Md 3 . „„c a ,~r " oi " im ° -*» — *• 

Given the coding sinmd sequences encodinc CR9P Hi„i Jt . 
NO:3 - SEQ ID NO:6, SEQ ID N0: 9, or SEQ D Z " ^ SEQ 

ofCRSP mRNA. An ■JSnE^^r"** ** " am ' i " i0 " * 
25, 30, 35, 40, 45 or 50 nucleorides " "en^ A r °«™«P>°. ebon. 5, 10, ,5, 20, 

15 can beconsrrnced ^_~T . "TT ™ ■»«** acid of ft. 

invention 

oligonucleonde) can be cnli Z>7Z^ T """^ ^ <e ' 8 " " 
various* nidified nncilZt , na,Un " y ° CCmi "8 - 

moiecni or to ££££ ^T^."^" "* bi< "° 8ical !Kbili » ° f *■ 
20 annsense and s^Z^^ T'" * »»" *• 

tinned nno^rl T ph0S > h °™ hi °«« "varives and acridine 

cMorouauei,, 5,^,, hyp0MMhillft 4-ac.~e 5 • 

^^Cn^^^~- 
7-methylguanine, 5-™^'^^ 5 - me%,Cyt ° Sine ' 
beta-D-mannosylcueosine 5' ^ 7 ^^^'"^ethyW-thiouracil, 

30 -e, hylW „. N lLX^ 

pseudouracii queosine ? thi . • oxyacetic acid (v), wybutoxosine, 

S-methylu.c-i ^^^'VT^^' 
S-me^-thiouraci,, l^lT^Tt ' - Ml 

2,6-diaminopurine. AltemaZT '■ N " 2 ; Carb ° XyPr0 ^ Uraci <> 0*3*. and 

35 biologically using an exZ ""^ "* Can be P roduced 

s <*ny using an expression vector into which a nucleic acid h« k- 

an antisense orientation (i e rw a , . . has been Cloned in 

ntation (i.e., RNA transcnbed from the inserted nucleic acid will be of 



WO 98/46755 



PCT/US98/07894 



-26- 



35 



an antisense orientation to a target nucleic acid of interest, described further in the 
following subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
5 and/or genomic DNA encoding a CRSP protein to thereby inhibit expression of the 
prolan, e.g., by mhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for examp.e, in the 
case of an anUsense nucleic acid molecule which binds to DNA duplexes through 
specfic mteractions in the major groove of the double helix. An example of a route of 
10 administration of antisense nucleic acid molecules of the invention include direct 

injecuon at a tissue site. Alternatively, antisense nucleic acid molecules can be modified 
to target selected cells and then administered systemically. For example, for systemic 
administration, antisense molecules can be modified such that they specifically bind to 
receptors or antigens expressed on a selected cell surface, e.g., by linking the antisense 
nucleic acid molecules to peptides or antibodies which bind to cell surface receptors or 
antigens^ The antisense nucleic acid molecules can also be delivered to cells using the 
vectors described herein. To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the antisense nucleic acid molecule is 
placed under the control of a strong pol II or pol III promoter are preferred 

In yet another embodiment, the antisense nucleic acid molecule of the invention 
is an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary UNA in which, contrary to the 
usual p-units, the strands run parallel to each other (Gaultier et al. ( 1 987) Nucleic Acids 
Kes. 15:6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o- 
methylnbonucleotide (Inoue et al. (1987) Nucleic Acids Res. 15:6131-6148) or a 
ch.menc RNA-DNA analogue (Inoue et al. (1987) FEBSLett. 215:327-330) 

In still another embodiment, an antisense nucleic acid of the invention is a 
nbozyme^ Ribozymes are catalytic UNA molecules with ribonucleic activity which are 
capable of cleavmg a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes 
(described in Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to 
catalytically cleave CRSP mRNA transcripts to thereby inhibit translation of CRSP 
mRNA. A nbozyme having specificity for a CRSP-encoding nucleic acid can be 
designed based upon the nucleotide sequence of a CRSP cDNA disclosed herein (i e 

IT' 10 N ° :4 ' SEQ ID N0:? ' SEQ ID NO:10 ' * e -quence 
of the DNA insert of the plasmid deposited with ATCC as Accession Number 98634 or 

the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as ' 
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Acce Ssion Number 98633) p 

«" be coveted in which the' 2 

to nucleotide sequence^ T^** Se <^ce of the ^ 9 ' VS *"A 

A/KmaUvelv CRSP ' • a °' iSzos ' a *.-' W.( 199 3 ) 

Ctata,,,^,, . <eeHyn, PB e'<./(I90 6 ,„. modified to 

synthesis 

N670.675. B - - * (.996, ^ °* 

•PPlicalions. Far e*,m , „ **' m< "«^ an be osed «. 

Pnn-ersforDNA^ nuc 'eases (Hymp B fioftn Wed c ™M«io„ 

^^r^^ M ^^^ptlr^; a!PK,braw 

fa another em K ^ } Pe "T- 

*** <* lipopjujj-g or ^ ft*, to «„ha„ ce ^ 
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molecules can be generated whi,h PNA-DNA chimeras of CRSP nucleic acid 

- DNA. Such c"2tr adVa "' a8e0US ProPe,,ieS ° f PNA 

<; ~i " cnimeras ="<»>'DNA recognition enzymes, (e.s RNAse H ,„rf r>w. 

5 polymerases), u, internet with (he DNA portion while L PNA L ^ 

high binding affinity and specificity PNA ™ T " °" W ° U ' d prOVi<le 

appmpriate iengfha seiecedTntl' o^Ta ^ """" link - " 
nucleobases, Z «t ^ ?"* ^ 

«™spo ^«.» S h„s, cat, receptees f„ „vo X ora6ents Mj " 

SW US «« , <SeC ' UBi ° 8er " <1989 > Atoi Acad 

Pf^P J ' Lema " re "'" Cl«87)Proe Atot Acad. Sci. USA 84-648 6« 

PCT Publ,ca„on No. W088/09810, published December 15 1 988) or d« h , , t 
barrier (see e a pptd li« ^ * iy5 <V or the blood-brain 

» adT, , '°" NO ' W089/10l34 'P» W ^«lApril25 1988) In 

> addition, oligonucleotides can be modified win, ktvr .■ . 

(See, e.g„ Krol « al 1 1 9881 *,„r T ™ M>"d.aat,on.tr,ggered cleavage agents 

e-8.,Zon ,98 l^^T^' 0 ^^^- 
fl . Isolated CRSP Proteins anH ay ) „„,., 

native ■S^^tt^ 7"*? iS0 ' a,ed ^ """^ 
.0 rmsc aoti-CRsZuC™ " T fra8mCTK "***■ «* use as immunogens 
c ami *-Kar antibodies. In one embodiment, native CRSP n m t»- u 
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5 Issue source tom which CRSp .iSSrta? °' 

'substantial,, fee of cellular material- includes ptepa^o^p ^ 
■be protein is separated ftom celluto compon J „ Z Z " *"* 

recombinant^ moduced I,,., ,. ™ S ° I " K <*"* fr °m which tl ,s isolated or 

more pmfembly | ess abou , 2Q% ™ " » • ^mmUmg pcotein"), 
man abou, 10% of „„ n . C R S P „,„, T Prole ' n ' S "" more '«s 
CRSP pcotein When .he CrIsp J" ""^ *- aboM 5% — 

embodiment, the language •subs, m ,i,,i„ «• , c synthesis of the protem. I n „ ne 
includes ptepanttions o, ^ * *" f Chemi,:al PreCWSOTS ° r *»**,. 

chemical ptecmsom o 2 CRSP b f ' T ' hm *°<* ™ 

Chemicn, preenrsors or ^ " - *" *" ™ 

Biologically active portions of a PR <u> r*^+ • • i „ 
antino acid _ ^ ~* 
sequence of th^ pdod * • aenved from the amino acid 

poly Pe ptidewhichi Sj forexaniple,10 25 50 lOOnL ? r ° temcanbea 
■ « . » 1 u > oo or more ammo acids in leneth 

In one embodiment, a biologically active portion of a CRSP ™, • 
leastacysteine-rich™;™ i ■ F"'uon or a CKbP protein compnses at 
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includes at least one cysteine-rich domain. In yet another embodiment, a biological* 
act.ve pomon of a CRSP protein comprises at least a signal sequence 

In an alternative embodiment, a biologically active portion of a CRSP protein 
compnses a CRSP amino acid sequence lacking a signal sequence 

nrot e - "* UnderSt0 ° d 3 PKfemd bi0l ° 8iCa,,y 3Ctive P° rti °n of a CRSP 

protem of the present mvention may contain at least one of the above-identified 

structural domains. A more preferred biologically active portion of a CRSP protein 
may contain at least two of the above-identified structural domains. An even more 
10 aTov^d ITT allyaCtiVeP ° rti0nof ^^PproteinmaycontainatleastZ ofthe 
T:^ZT d0 ^ nS - A ~^^biolo g icallyactiveponil; 

Moreover, other biologically active portions, in which other regions of the 
to more of the functional activities of a native CRSP protein 

in SEO ID NO ^ eml>0dimen, • ** CRSP *** » -»i"o acid sequence shown 

NC 2 I„ ° f am ' n ° ^ SeqUe " Ce M leaS ' ab ° M 55% *»*■«» «o SEQ ID 

20 SEQ,DNO: 5 . '""oU.erpreferrederohodiroent.uteCRSPproteinhaaanJre.acid 
equenee shot™ in SEQ ID NO:8 or an amino acid sequence a. leas, ahou, SS% 
hornoiogous ,„ SEQ IDNO:8. In another preferred embodiment, me CRSP protein has 
™ actd sequence shown in SEQ ,D NO: II or an amino acid sequence a. leas, 

* W P^t*™' OSEQ,DNO: "- — ~embodimen,a 

Zl ^ a, ^ " ' y """" "°' 45% - m ° re Preferabl >' <*»"• 50-55%. even 

L Z tZn™**" ami "° SeqUe,Kes *°™ » SEQ ID NO-2 
SEQIDNO:5,SEQIDNO:8,orSEQIDNO:ll 

30 NO-2 S^^r* CRSP ProBin " h ° m °'°S°- .° SEQ ID 

NO.2, SEQ ID NO:5, SEQ ID NO:8, or SEQ ID NO:l 1, and , preferably, retains the 

SEO 7*0 'n' y ^ ° f ,D ^ SEQ ' D N0:S ' NO:8, „ 

SEQ ID NO: 1 1 ye, data in amino acid sequence due ,o natural alldic variation or 
mmaganesis, « described in deh.il ,„ subsection I above. Accordingly, in another 

CRS f proKin r proKin which * - " — - 

leas, abou, 00% homologous to die amino aeid sequence of SEQ ID NO-2 SEO ID 
NO:5, SEQ ID NO:8, or SEQ ,D NO:, I and, preferebiy, retake ro^ronai 
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NO.5, SEQ ID NO* or SEQ ID NO- 1 ,1 ^ * §EQ ID »0:2 SEQ ID 

homologous to SEQ ID NO:2, SEQ^D N(fc5, SEQ ID NO^ ^ ab ° Ut ? *^ More 
To determine the percent homology ofnl '° rSEQlDN °' J1 - 

h "omology fa ("•. « »»ed he reu] ^ acid or 

^"■ogybe^n^^ a ^ n °« c ' d ° "^taoacid -idewi^. 

by the S e,„e„ce s (i e ' f°°f' S * fcnc, «>° «f the number of y 
x mm tu J ^nomoJo2ya#ofMpirtj« i wnucai positions 

25 M »' Bioi. 215:403-10 BLAST , MSI|>r0 « ram! »f Alfachul elal nJLw 

h0 ™'^ ■» CRSP nuctei<! acjd " 12 to nucieoiide 

<« b= Perfonned win, ^ XBLAST „ '" Ven,ioa BLA " p,„ J„ II 

acid ^ nces hom0 ^ L o t S ^7 D ■« - 50, WrtUagth , P 3 J^*- 

*scnbed in AlfacM « al <l 997m T ^ ' Gapped BLA ST can be wili J 
U,i,i ^BLAST„d Gapped bla ST ^^'^^^3^5^" 
~ (e. 6 .,XBLAST»d *" . le J^l 
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coming amino acid fences, a PAMI20 weigh, residue ^ . gap , 

ofl2, and a gap penalty of 4 can be used. V 

CRSP •H i " Vemi °° 3,50 Pr ° VideS CRSP Chimeri ° °' fuSi °" As ^ein a 

finked to a non-CRSP polypepnde. A CRSP polypeptide" refers to a polypeptide 
havtng an ammo acid sequence corresponding ,„ CRSP. whereas a "non-CRSP 

ZTofTct; t pr r n *■ crsp - — «- - - - 

portton of a CRSP prottm. In a preferred embodiment, a CRSP fusion protein 
compnses « leas, one biologically active portion of a CRSP protein In 
preyed -bodl— . a CRSP fusion protein comprises a, leas, ,w„ o^X ac,i»e 
porttons of, CRSP protein. ,„ ^ preferred ^ 

15 ^-'Hea^.taeebiologioally^vepo^ofaC^Ppro.em.MZl 
<u ton protem, me ,enn "operativcly linked" is intended ,„ Mil ^ fc ™ 

polype and me non-CRSP polypeptide are fused in-fiame to each other 1,. 
^y^de ' °" '° N "' em,i ° US " C - Krminus of "* CRSP 

20 For example, in one embodiment, the fusion protein is a OST-CRSP fusion 

roKtn m whtch ,h. CRSP sconces am fused ,„ the C-termmns of the GST ^Lm 
Such fuston protems can facilitate the purification of recombinant CRSP 

In another embodiment, the fusion profein is a CRSP protein conmining a 

25 « 7 T ^ a ' iK N - Km,i " US - FOT «""■* ,h = — <*SP signal 
Z (I. , about ammo acids 1 ,o 23 of SEQ ID NO:2) oa„ be removed and reload 

«lls), expresston and/or seomtion of CRSP can be increased through use of a 
heterologous signal sequence. 

In yet another embodiment, the fusion protein is , CRSP-immnnoglobuiin fusion 
30 protem ,„ whtch the CRSP seances comprising primarily the CRSP oys cinetch 

sr :rr M de,ived &om a ^ ^ ■ ^ ^ 

tauL, " ' ^ ^ ° f -*» me 
gene superfamify consisting of _ ^ 

surface glycoproKmfhsedto an immunoglobulin constant (Fc) region (see eg Capon 

CM-IgOl constructs); Unsley, P.S. «,«£ (1991)./ fip Man! 173:721-730 IaCD28 
IgG, construe and a B7-MgO, and Linslcy, P.S. ^JuXZi 
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«*» proteins of*. 1Um „, ^ 'T "" OTHi0 ° S - S °»« <Wvaiv« of eel I 
been t^de consisting JZ^ZS!T T ^ 
5 --«.o. ulfa oonL„ tff ^7^°^««^ K oep torfilsedl o an 
J- Mod. 337(3):141-I47; van "" * 

.» between a CRSP ligand £ XrCoToT » " 

fusion protemsoao bo used to affeolthahi^v •, Jr CRSP -' n ™arioglobuli„ 
Inhibition of the CRSP ,igi„dW„ 7 " °" CRSP C °* Mtt re -P- 
<* boatmen, of diflenenS^".^ * ^ for both 

Moreover, the C R SP- immun o 8 , ooulill ^ ^ f a^or cel, tate. 

nnntunogens ,„ prodllce Mli . CRS " !»««■» of the ,„ve„„o„ can be used as 

* assays ,o *** «• *-* and 

CRSPhgand. ores which inhibit the interaction of CRSP with a 

20 

conventional techniques, f or oxamnle hv , "o^ance with 

•o-mini for l^^^Zl^^ tended 
* »,i„ „ f ooheaive ends 1 p^ , TkloTT * ~" «* 
ondesitable joining, ^ Zo Zl ^' 'I ' n!a,m ™ to "«*> 

oan beayndteaized by tJSuC Z? fa a«,„„ embodiment, thefbsion gene 
Alternatively, PCR L^Z^T" — DN A ^theLra 

Primes which give rise ,„ comlleml " *" anchor 

30 ftagmena which can *J££ZZZT? '"° C °" S ~"' Ve 

^^oence^foreaC c^r^ 3 " <~e a chimeric > 

« * fohn Wiley & Sons: I9 £ " eda. A„s„be, 

available tha, already encode a LES^ ~ " ~* 

encoding nucleic aoid can be cloned i„,„ , ""'W*)- A CRSP- 

* -ie* is linjted in-W J " ~" "* *" <* 

The present invention also perbtins to variants of „„„„„ . 
««>ct,on aa either CRSP agonists (tnitnetics) o, aa CRsl^ '"°*"' S ^ 

s) or aa CRSP antagonists. Variants of the 
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CRSP protein. An antagonist of a TR^P r,r^- * i_ *i_ ■ 8 wnnora 

*— * « ~ o f rx ~ 

occurring form of the CRSP protein. naturally 

b one embodiment, variants of a CRSP protein which function as either CRSP 

15 „„„ • * mutants, e.g., truncation mutants, of a CRSP protein for CRSP protein 

'5 .gon,s,o,a„„ g „„ islactivlty . ] n one embodiment, a variegated library of CRSP van , 

varregated gene Irbnuy. A vanegated library of CRSP variants ean be produced bv L 

20 rndrvrdua, po, y pep, id e s , « a,^,,, . se , ^ ...JT* " 

d. play) contaming to sel of CRSP J> v Jy of m emoot 

In addition, libraries of fragments of a CRSP protein coding sequence can ha 
used to generare a variegated population of CRSP fragments for scroS Id 
subsequent faction of variants of a CRSP protein. ,„ one embodiment ,"bl of 

35 frogmen, of, CRSP coding sequence wirh a nucleaae under condition, wherom nickinc 
ocouraon y abou, per mo,acu,e, denamring Ure double «*^£ZST 
<"» ■» form double stranded DNA which can include serrsefcnrisens. pairs from 
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CRSP proteins. The most wiH.lv a u combinatorial mutagenesis of 

into replicable expression vectors tr^fT C, ° mn8 the gene librai ? 

combination with th P • "wanes, can be used in 

A «) ^^r* 10 f;, ,ify crsp v>rian,s (Arkin - Yci ™ 

CRSP library For e ™ l 7 K T" " eXP '° i,ed 10 * 

— h-d ce, ls are men c^ted wilh „ e „^ ^ ^ 

mutant on siting b y ^ , i8and „ ^ ^ * ^ d £ *™™° «f *■ 
immune cell responses PlasmidnMa .c c measurmg any of a number of 

for inhibition, JJ^T^r rT ""^ *™ ** °°" S WhWl — 
25 further chamcterized ^ " """"^ Md "* mdiyidtta. clones 

POlyclon, an. monodona, -^C^SSSST h 
used or, alternatively the inv P nti rtn -J proteln 030 be 

30 use aa ^ SSKT T~ ^ * 
residues of the amino acid , J.. p y"* ae ot CRS P comprises at least 8 amino acid 

raised against the pernio T- " eP " OPe " CRSP SUCh *" " «««»■, 

5 amino acid residues, „ ^ ZZ'ZZ T 

preferably at te, 30 amino acid reaiduea m ° s < 
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Preferred epitopes encompassed by the antigenic peptide are regions of CRSP 
that are located on the surface of the protein, e.g., hydrophilic regions. 

A CRSP immunogen typically is used to prepare antibodies by immunizing a 
smtable subject, (e.g., rabbit, goat, mouse or other mammal) with the immunogen An 
appropriate immunogenic preparation can contain, for example, recombinant^ 
expressed CRSP protein or a chemically synthesized CRSP polypeptide The 
preparation can further include an adjuvant, such as Freund's complete or incomplete 
adjuvant, or similar immunostimulatory agent. Immunization of a suitable subject with 
an immunogenic CRSP preparation induces a polyclonal anti-CRSP antibody response 
Accordmgly, another aspect of the invention pertains to anti-CRSP antibodies ' 
The term "antibody" as used herein refers to immunoglobulin molecules and 
.mmunologically active portions of immunoglobulin molecules, i.e., molecules that 

CR^Ex bind , ing f ^ SPedflCa,ly ^ (immUn ° reaC tS «*> - «*■«. 
such as CRSP. Examples of .mmunologically active portions of immunoglobulin 

15 molecu.es mclude F(ab) and F(ab') 2 fragments which can be generated by treating the 
anybody with an enzyme such as pepsin. The invention provides polyclonal and 
monoclonal antibodies that bind CRSP. The term "monoclonal antibody" or 
"monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 
20 immunoreacting with a particular epitope of CRSP. A monoclonal antibody 

composition thus typically displays a single binding affinity for a particular CRSP 
protein with which it immunoreacts. 

Polyclonal anti-CRSP antibodies can be prepared as described above by 
unmumzing a suitable subject with a CRSP immunogen. The anti-CRSP antibody titer 
m the immunized subject can be monitored oyer t . me by standar(j techni£}ues> ^ ^ 

with an enzyme linked immunosorbent assay (ELISA) using immobilized CRSP If 
desired, the antibody molecules directed against CRSP can be isolated from the mammal 
(e.g., from the blood) and further purified by well known techniques, such as protein A 
chromatography to obtain the IgG fraction. At an appropriate time after immunization 
e^g when the anti-CRSP antibody titers are highest, antibody-producing cells can be 
obtained from the subject and used to prepare monoclonal antibodies by standard 
techniques, such as the hybridoma technique originally described by Kohler and 

niToT? Na ' Ure 256:495 - 497) ^ a,S °' Br0W " 6t aL < 1981 ^ ^munol. 
6 ^ti 5 rr aL (1980) 1 Bi ° L Chem 255:498 °^ Yeh et al. (1976) PNAS 

ce.i I I ''I : ^ ° 982) ^ ^ CanC6r 29:269 ' 75) ' * e more human B 

cell hybndoma technique (Kozbor et al. (1 983) Immunol Today 4:72) the EBV- 

hybndoma technique (Cole et al. (1985), Monoclonal Antibodies and Cancer Therapy 
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Alan R Liss, Inc., pp. 77-96) or trioma techniques. The technology for producing 
monoclonal antibody hybridomas is well known (see generally R. H. Kenneth in 
^nocW/^ ^ ^ w //? ^ /co/ ^ a/ ^ pjenum ^ 

Corp.,NewYork,NewYork(I980);E.A.Lerner(1981)ra/ey.5/ o / Med. 
5 54:387-402; M.L.Gefteretal. (1977) Somatic Cell Genet. 3:231-36) Briefly an 

immortal cell line (typically a myeloma) is fused to lymphocytes (typically splenocytes) 
from a mammal immunized with a CRSP immunogen as described above and the 
culture supernatants of the resulting hybridoma cells are screened to identify a 
hybndoma producing a monoclonal antibody that binds CRSP. 
10 Any of the many well known protocols used for fusing lymphocytes and 

immortalized cell lines can be applied for the purpose of generating an anti-CRSP 
monoclonal antibody (see, e.g., G. Galfre et al. (1977) Nature 266:55052; Gefter et al 
Somatic Cell Gene,, cited supra; Lerner, Yale J. Biol. Med., cited supra; Kenneth 
Monoclonal Antibodies, cited supra). Moreover, the ordinarily skilled worker will 
15 appreciate that there are many variations of such methods which also would be useful 
Typically, the immortal cell line (e.g., a myeloma cell line) is derived from the same ' 
mammalaan species as the lymphocytes. For example, murine hybridomas can be made 
by fusing lymphocytes from a mouse immunized with an immunogenic preparation of 
die present invention with an immortalized mouse cell line. Preferred immortal cell 
lines are mouse myeloma cell lines that are sensitive to culture medium containing 
hypoxanthme, aminopterin and thymidine ("HAT medium"). Any of a number of 
myeloma cell lines can be used as a fusion partner according to standard techniques 
e.g the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or S P 2/0-A g 14 myeloma lines. These ' 
myeloma lines are available from ATCC. Typically, HAT-sensitive mouse myeloma 
ce s are fused to mouse splenocytes using polyethylene glycol ("PEG"). Hybridoma 
cells resulting from the fusion are then selected using HAT medium, which kills unfused 
and unproductively fused myeloma cells (unfused splenocytes die after several days 
because they are not transformed). Hybridoma cells producing a monoclonal antibody 
of the invention are detected by screening the hybridoma culture supernatants for 
30 anybodies that bind CRSP, e.g., using a standard ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas a 
monoclonal anti-CRSP antibody can be identified and isolated by screening a 
recombinant combinatorial immunoglobulin library (e.g., an antibody phage display 
hbrary) with CRSP to thereby isolate immunoglobulin library members that bind CRSP 
Kits for generating and screening phage display libraries are commercially available 
e.g the Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01 ■ and 
theStmtagene*^^ Additionally ' 
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examples of methods and reagents particularly amenable for use in generate and 

= t srr ,ibr ? can be found - for - * ~ 

patent No. 5,223,409; Kang et al. PCT Internationa] Publication No WO 92/1 86 1 9- 
D - e -tal.PCTInternationa,PublicationNo.W09,/17^ 
5 International Publication WO 92/20791 ; Markland et al PCT Intl ouJ- 

No. WO 92/1 5679- r w ;,i- . , n „J International Publication 

M r " ' Bre,t,ing 61 91 PCT international Publication WO 93/01288- 

McCafferty et al. PCT International Publication No WO92/01047 P , » 

International Publication No. WO 92/09690; Ladner PCT 1 7 , * ^ 
No. WO 90/02809; Fuchs et al fl99n bJtT7 «' lDte ™ tlon!l1 Plication 

10 Hum. AM i^J^^^T^T^ ^ Ct * (, " 2) 
, OJ ' nuseetai. (l 989) Science 246 1 275- 1781 • 

Griffiths et al. (1993) EMBO J 17-77^ 734. u ■■ ^-^/M^l, 

SO*,-. ^ / 34;Hawklnsetal - (1992) y.A/o/ 5,o/ 226-880 

896; Clarkson et al. (1 991 ) AW* 352-674 /;->». , -"6.889- 

„ v y J02& '' 4 " 628 ; Gram eta. (1992) pa/4 C8o-7<-7/c 

3580; Garrad et al. (1991) Bio/Technology 9 1373-1377- Hon* T f 
^ctfto 4n7 . q . , ,37 AHoogenboometal.(1991)A/ W c. 

iy.4133-4137; Barbas eta. (1991) PM4988-7Q7B -7oo-> ... 
15 al.AWe (1990) 348:552-554. y ^™^-7978-7982; and McCafferty et 

Additionally, recombinant anti-CRSP antibodies, such as chimeric and 
Which cm be made usmg sandaK j reccMnbinant DNA technioues are within 1. . 

Patent Application 1 84 1 87- T a n.„ u- r. °°'"^oy, Akira, et al. European 

PP canon 184,187, Tamguchi, M., European Patent Application 171 496- 

. teru.b.ment 5,225,539; Jones et al. (1986) Nature 321 -552-525- 

hv « H A °."° U f RS? "°' i ''° dy <e ' 8 -' mo "°"™ al «»ody) ean be used to isolate CRSP 

Ml. CRSP ambody can facdttate the purification of natural CRSP frem cells and of 
recombinantly produced CRSP expressed in host cells. Moreover m " « 
amtbody can be used ,„ derec, CRSP protein (e.g.. i„ , ^ 
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supernatant) in order to evaluate the abundance and pattern of expression of the CRSP 

t-ssue as part of a cluneal testing procedure, e.g., to, for example, determine the efficacy 
of a gl ven treatment regime, Detection can be facilitated by coupling (i.e., 
nkmg) the ant.body to a detectable substance. Examples of detect UtanT 
mc ude various enzymes, prosthetic groups, fluorescent materials, luminescent 
matenals, biolum mescent materials, and radioactive material, Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase, p-galactosidase, or 
acety lcholmesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelhferone, fluorescein, fluorescein isothiocyanate, rhodamine 
d.chlorotriazinylamine fluorescein, dansy. chloride or phycoerythrin; an example of a 

t aequorin ' md examples of suitab,e radioactive material inc,ude 

«> i, o or H. 
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30 



HI. Recombinant Exnre^inn V ectors anH Un,t gells 

Another aspect of the invention pertains to vectors, preferably expression 

20 T " 0ntami " 8 3 nUC,CiC enC ° ding CRSP < or a hereof). As used herein 

20 the term "vector" refers to a nucleic acid molecule capable of transporting another 

tTa 7 7 k ^ bee " ^ ^ ° f VeCt ° r iS * ~> which refers 

o a circular ouble stranded DNA loop into which additional DNA segments can b 

hg ted. Another type of vector is a vira, vector, wherein additional DNA segments can 

be hgated into the viral genome. Certain vectors are capable of autonomous rep^Z 

ma host cell mto which they are introduced (e.g., bacterial vectors having a bacteria 

ongm of replication and episoma. mammalian vectors). Other vectors (e g non- 
episomal mammalian vectors) are integrated into the genome of a host cell "upon 
mtroduction into the host cel., and thereby are replicated along with the host genome 

Moreover certam vectors are capable of directing the expression of genes to whi h L 
are o p tlve]y linked . Such ^ ^ ^ ^ ^ ^ ^ ^ Jhd. they 

fZ^ T Pr T n T VCCt0rS ° f Uti ' ity " reC ° mbinant ° NA teChni ^ es - oft- in the 
form of plasrmds. In the present specification, "plasmid" and "vector" can be used 

•nterchangeably as the plasmid is the most commonly used form of vector. However 

^ 0thCr f ° mS ° f CXpreSSi0n «* as vi'ral 

vectors (e.g replication defective retroviruses, adenoviruses and adeno-associated 
viruses), which serve equivalent functions. 
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The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which 
means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 
5 operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
10 sequence" is intended to includes promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described for 
example, m Goeddel; Gene Expression Technology: Methods in Enzymology 185 ' 
Academic Press, San Diego, CA (1 990). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cell and 
15 those which direct expression of the nucleotide sequence only in certain host cells (e g 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art ' 
that the design of the expression vector can depend on such factors as the choice of the 
host cell to be transformed, the level of expression of protein desired, etc The 
expression vectors of the invention can be introduced into host cells to thereby produce 
20 proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
descnbed herein (e.g., CRSP proteins, mutant forms of CRSP, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for 
expression of CRSP in prokaryotic or eukaryotic cells. For example, CRSP can be 
expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression 
25 vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in 
Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press 
San Diego, CA (1990). Alternatively, the recombinant expression vector can be 
transcribed and translated /„ vitro, for example using T7 promoter regulatory sequences 
and T7 polymerase. 

30 Expression of proteins in prokaryotes is most often carried out in E coli with 

vectors containing constitutive or inducible promotors directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 

35 2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
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moiety and the recombinant protein to enable separation of the recombinant protein from 
he fus.on m 01 ety subsequent to purification of the fusion protein. Such enzymes, and 
the.r cognate recognition sequences, include Factor Xa, thrombin and enterokinase 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D B 

and pRITS (Pharmaca, P.scataway, NJ) which fuse glutathione S-transferase (GST) 
maltose E binding protein, or protein A, respectively, to the target recombinant protein 

Punfied fusion proteins can be utilized in CRSP activity assays, in CRSP ligand 
bmdrng (e. g ., ^ assays or emape66w ^szys described in detail below), to generate 
specie for CRSP proteins, as example, In a preferred embodiment a CRSP 

to nf T m ' retr ° Viral eXPreSSi ° n VCCt0r ° f ^ PrCSent »"»*■ - be utilized 
to mfect bone marrow cells which are subsequently transplanted into irradiated 

recrprents The pathology of the subject recipient is then examined after sufficient time 
has passed (e.g six (6) weeks). 

15 Examples of suitable inducible non-fusion £. coli expression vectors include 

pTrc (Amann et al., (1988) Gene 69:301-315) and pET 1 Id (Studier et al., Gene 
Expression Technology: Methods in Emymology 185, Academic Press, San Diego 
CaMorma (1 990) 60-89, Target gene expression from the pTrc vector relies on host 

P 0l y merase transcription from a hybrid trp-lac fusion promoter. Target gene 
expression from the pET 1 Id vector relies on transcription from a T7 gn.O lac fusion 
promoter mediated by a coexpressed viral RNA polymerase (T7 gnl) This viral 
polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant protein expression in K coli is to express 
the protein m a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 
Emymology 185, Academic Press, San Diego, California (1990) 1 19-128) Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
express^ vector so that the individual codons for each amino acid are those 
preferentially utilized in E coli (Wada et al., (1992) Nucleic Acids Res. 20:21 11-2118) 
Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. 

Examn, 1 " ^ is 3 »~ ^ssion vector 

Examples of vectors for expression in yeast S. cerivisae include pYepSecl (Baldari et 
al., (1987) Emho J. 6:229-234), pMFa (Kurjan and Herskowitz, (1 982) Cell K 
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Zv^r^T 61 (,98?) GCne 54:1 13 - ,23) ' PYES2 Corporation, 
San D,ego, CA), and picZ (InVitrogen Corp, San Diego, CA). 

Alternatively, CRSP can be expressed in insect cells using baculovirus 
expression vectors. Baculovirus vectors available for expression of proteins in cultured 

T^ C ! (e ' 8 " ^ 9 CCl,S) indUde PAC SCrieS (Smith Ct aL ( ,983 >^'- Cell Biol 
3:2156-2165) and the pVL series (Lucklow and Summers (1989) Wra/ogy 170 31-39) 

In yet another embodiment, a nucleic acid of the invention is expressed in 

mammahan cells using a mammalian expression vector. Examples of mammalian 

expression vectors include pCDMS (Seed, B. (1 987) Nature 329:840) and pMT2PC 

10 (Kaufman et al. (1987) EMBOJ. 6:187-195). When used in mammalian cells, the 
express™ vector's control functions are often provided by viral regulatory elements 
For example, commonly used promoters are derived from polyoma, Adenovirus 2 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook, J. Fritsh E F 

15 and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold Spring " 
Harbor Laboratory. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
20 (e.g. tissue-specific regulatory elements are used to express the nucleic acid) Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-spec.nc promoters include the albumin promoter (liver-specific; Pinkert et al 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto and 

?^T1 ( T } EMBOJ 8:?29 - 733) ^ immuno ^ulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle (1989) PNAS 86 5473-5477) 
pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), and mammary 
gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4,873,3 16 and 
30 European Application Publication No. 264,166). Developmentally-regulated promoters 
are also encompassed, for example the murine hox promoters (Kessel and Gruss (1990) 
Saence 249:374-379) and the a-feto P rotein promoter (Campes and Tilghman (1989) 
Genes Dev. 3:537-546). 

« nxu ^ inVenti ° n &rther Pr ° VideS ' recombina "< expression vector comprising a 
35 DNA molecule of the invention cloned into the expression vector in an antisense 

orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 
a manner which allows for expression (by transcription of the DNA molecule) of an 
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- — «: "rjir rcrr •- * chose " *~ 

instance viral p romoters ™ ™ A m ° le "" c a of cell types, for 

5 direct -—^TZL^rT' M reeU ' a ' 0ry SeqUMCeS CM * 

.0 expression osin, ^^T^TT* 0 ''!" reeU ' ati ° n ^ 

of such . cel, B~r„ JM 10 *" ^ °' >° K ""* P^eny 

0 n ro „- A "t S,Ce " '""'o^ofeoreukatyotic cell. For example CRSP 

protetn can be expressed in bacterial cell, such aa E call i„«e. I ,, 
tnantnahan ce„s (anch as Chinese hamster o^Z^J£T?" t 
anitahle hoar cells are known to rhoae skilled iTL „ ^ 

For stable transfection of mammalian cells it is known that H T 
^^lo^ehoatce^Z^ro^^r^ 
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selectable markers include those which confer resistance to drugs, such as G41 8, 
hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be 
introduced into a host cell on the same vector as that encoding CRSP or can be 
introduced on a separate vector. Cells stably transfected with the introduced nucleic 
acid can be identified by drug selection (e.g., cells that have incorporated the selectable 
marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) CRSP protein. Accordingly, the invention 
further provides methods for producing CRSP protein using the host cells of the 
invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding CRSP has been 
introduced) in a suitable medium such that CRSP protein is produced. In another 
embodiment, the method further comprises isolating CRSP from the medium or the host 
cell. 

The host cells of the invention can also be used to produce nonhuman transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which CRSP-coding sequences have been 
introduced. Such host cells can then be used to create non-human transgenic animals in 
which exogenous CRSP sequences have been introduced into their genome or 
homologous recombinant animals in which endogenous CRSP sequences have been 
altered. Such animals are useful for studying the function and/or activity of CRSP and 
for identifying and/or evaluating modulators of CRSP activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA which is 
integrated into the genome of a cell from which a transgenic animal develops and which 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 
used herein, a "homologous recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous CRSP gene has been 
altered by homologous recombination between the endogenous gene and an exogenous 
DNA molecule introduced into a cell of the animal, e.g., an embryonic cell of the 
animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing CRSP- 
ericoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by 
microinjection, retroviral infection, and allowing the oocyte to develop in a 
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pseudopregnant female foster animal. The human CRSP cDNA sequence of SEQ ID 
NO: 1, SEQ ID NO:4, SEQ ID NO:7, or SEQ ID NO: 10 can be introduced as a transgene 
into the genome of a non-human animal. Alternatively, a nonhuman homologue of a 
human CRSP gene, such as a mouse CRSP gene, can be isolated based on hybridization 
to the human CRSP cDNA (described further in subsection I above) and used as a 
transgene. Intronic sequences and polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the transgene. A tissue-specific 
regulatory sequence(s) can be operably linked to the CRSP transgene to direct 
expression of CRSP protein to particular cells. Methods for generating transgenic 
animals via embryo manipulation and microinjection, particularly animals such as mice, 
have become conventional in the art and are described, for example, in U.S. Patent Nos. 
4,736,866 and 4,870,009, both by Leder et al, U.S. Patent No. 4,873,191 by Wagner et 
al. and in Hogan, B., Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y., 1986). Similar methods are used for production of 
other transgenic animals. A transgenic founder animal can be identified based upon the 
presence of the CRSP transgene in its genome and/or expression of CRSP mRNA in 
tissues or cells of the animals. A transgenic founder animal can then be used to breed 
additional animals carrying the transgene. Moreover, transgenic animals carrying a 
transgene encoding CRSP can further be bred to other transgenic animals carrying other 
transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
at least a portion of a CRSP gene into which a deletion, addition or substitution has been 
introduced to thereby alter, e.g., functionally disrupt, the CRSP gene. The CRSP gene 
can be a human gene (e.g., the cDNA of SEQ ID NO:3, SEQ ID NO: 6, SEQ ID NO:9 
or SEQ ID NO: 12), but more preferably, is a non-human homologue of a human CRSP 
gene. For example, a mouse CRSP gene of SEQ ID NO:16 can be used to construct a 
homologous recombination vector suitable for altering an endogenous CRSP gene in the 
mouse genome. In a preferred embodiment, the vector is designed such that, upon 
homologous recombination, the endogenous CRSP gene is functionally disrupted (i.e., 
no longer encodes a functional protein; also referred to as a "knock out" vector). 
Alternatively, the vector can be designed such that, upon homologous recombination, 
the endogenous CRSP gene is mutated or otherwise altered but still encodes functional 
protein (e.g., the upstream regulatory region can be altered to thereby alter the 
expression of the endogenous CRSP protein). In the homologous recombination vector, 
the altered portion of the CRSP gene is flanked at its 5' and 3' ends by additional nucleic 
acid of the CRSP gene to allow for homologous recombination to occur between the 
exogenous CRSP gene carried by the vector and an endogenous CRSP gene in an 
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embryonic stem cell. The additional flanking CRSP nucleic acid is of sufficient length 
for successful homologous recombination with the endogenous gene. Typically, several 
kilobases of flanking DNA (both at the 5' and 3' ends) are included in the vector (see 
e.g., Thomas, K.R. and Capecchi, M. R. (1987) Cell 51 :503 for a description of 
5 homologous recombination vectors). The vector is introduced into an embryonic stem 
cell line (e.g., by electroporation) and cells in which the introduced CRSP gene has 
homologously recombined with the endogenous CRSP gene are selected (see e.g., Li, E. 
et al. (1992) Cell 69:915). The selected cells are then injected into a blastocyst of an 
animal (e.g., a mouse) to form aggregation chimeras (see e.g., Bradley, A. in 
1 0 Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, E J. Robertson, 
ed. (IRL, Oxford, 1987) pp. 1 13.152). A chimeric embryo can then be implanted into a 
suitable pseudopregnant female foster animal and the embryo brought to term. Progeny 
harboring the homologously recombined DNA in their germ cells can be used to breed 
animals in which all cells of the animal contain the homologously recombined DNA by 
15 germline transmission of the transgene. Methods for constructing homologous 

recombination vectors and homologous recombinant animals are described further in 
Bradley, A. (1991) Current Opinion in Biotechnology 2:823-829 and in PCT 
International Publication Nos.: WO 90/1 1354 by Le Mouellec et al.; WO 91/01 140 by 
Smithies et al.; WO 92/0968 by Zijlstra et al.; and WO 93/041 69 by Bems et al. 
20 In another embodiment, transgenic non-humans animals can be produced which 

contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI . For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) PNAS 
89:6232-6236. Another example of a recombinase system is the FLP recombinase 
25 system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 251 :1351-1355. If 
a cre/loxP recombinase system is used to regulate expression of the transgene, animals 
containing transgenes encoding both the Cre recombinase and a selected protein are 
required. Such animals can be provided through the construction of "double" transgenic 
animals, e.g., by mating two transgenic animals, one containing a transgene encoding a 
30 selected protein and the other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be 
produced according to the methods described in Wilmut, I. et al. (1997) Nature 385:810- 
813. In brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and 
induced to exit the growth cycle and enter G 0 phase. The quiescent cell can then be 
35 fused, e.g., through the use of electrical pulses, to an enucleated oocyte from an animal 
of the same species from which the quiescent cell is isolated. The recontructed oocyte is 
then cultured such that it develops to morula or blastocyte and then transferred to 
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pseudopregnant female foster animal. The offspring borne of this female foster animal 
will be a clone of the animal from which the cell, e.g., the somatic cell, is isolated. 

IV. Pharmace utical Cnmp ngif »r.c 

The CRSP nucleic acid molecules, CRSP proteins, and anti-CRSP antibodies 
(also referred to herein as "active compounds") of the invention can be incorporated into 
pharmaceutical compositions suitable for administration. Such compositions typically 
compnse the nucleic acid molecule, protein, or antibody and a pharmaceutical^ 
acceptable carrier. As used herein the language "pharmaceutically acceptable carrier" is 
intended to include any and all solvents, dispersion media, coatings, antibacterial and 
antifungal agents, isotonic and absorption delaying agents, and the like, compatible with 
Pharmaceutical administration. The use of such media and agents for pharmaceutically 
active substances is we,, known in the art. Except insofar as any conventiona, media or 
agent is incompatible with the active compound, use thereof in the compositions is 
15 contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation) 
transdermal (topica,), transmural, and rectal administration. Solutions or suspensions 
used for parenteral, intraderma,, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents- 
antibacterial agents such as benzy, alcoho, or methyl parabens; antioxidants such as 
ascorine acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicty such as sodium chloride or dextrose. pH can be adjusted with acids or bases 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be ' 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
30 plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS) In 
all cases, the composition must be sterile and should be fluid to the extent that easy ' 
synngability exists. It must be stable under the conditions of manufacture and storage 
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and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
5 can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
1 0 isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 

chloride in the composition. Prolonged absorption of the injectable compositions can be 
brought about by including in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a CRSP protein or anti-CRSP antibody) in the required amount in an 
appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by 
incorporating the active compound into a sterile vehicle which contains a basic 
dispersion medium and the required other ingredients from those enumerated above. In 
20 the case of sterile powders for the preparation of sterile injectable solutions, the 

preferred methods of preparation are vacuum drying and freeze-drying which yields a 
powder of the active ingredient plus any additional desired ingredient from a previously 
sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically 
compatible binding agents, and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and the like can contain any of the 
following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a lubricant 
35 such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmural or transdermal means For 
5 transmural or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art 
and mclude, for example, for transmural administration, detergents, bile salts, and ' 
fusidic acid derivatives. Transmural administration can be accomplished through the 
use ofnasal sprays or suppositories. For transdermal administration, the active 
10 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e g with 
conventual suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

15 In one embodiment, the active compounds are prepared with carriers that will 

protectee compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid 
20 Methods for preparation of such formulations will be apparent to those skilled in the art 
The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells w,th monoclonal antibodies to viral antigens) can also be used as pharmaceutically 
acceptable earners. These can be prepared according to methods known to those skilled 
.5 in the art, for example, as described in U.S. Patent No. 4,522,8 1 1 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herem refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
0 calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

Toxicity and therapeutic efficacy of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e g. for 
determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose 
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therapeutically effective in 50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be expressed as the ratio 
LD50/ED50. Compounds which exhibit large therapeutic indices are preferred. While 
compounds that exhibit toxic side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of affected tissue in order to 
minimize potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with little 
or no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the 
method of the invention, the therapeutically effective dose can be estimated initially 
from cell culture assays. A dose may be formulated in animal models to achieve a 
circulating plasma concentration range that includes the IC50 (i.e., the concentration of 
1 5 the test compound which achieves a half-maximal inhibition of symptoms) as 

determined in cell culture. Such information can be used to more accurately determine 
useful doses in humans. Levels in plasma may be measured, for example, by high 
performance liquid chromatography. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
20 as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (see U.S. Patent 5,328,470) or by 
stereotactic injection (see e.g., Chen etal. (1994) /W/1S 91:3054-3057). The 
pharmaceutical preparation of the gene therapy vector can include the gene therapy 
vector in an acceptable diluent, or can comprise a slow release matrix in which the gene 
25 delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g., retroviral vectors, the 
pharmaceutical preparation can include one or more cells which produce the gene 
delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
30 dispenser together with instructions for administration. 

V. Uses and Methods of the Invention 

The molecules of the present invention (e.g., nucleic acid molecules, proteins, 
protein homologues, and antibodies described herein) can be used in one or more of die 
35 following methods: a) screening assays; b) predictive medicine (e.g., diagnostic assays, 
prognostic assays, monitoring clinical trials, and pharmacogenetics); and c) methods of 
treatment (e.g., therapeutic and prophylactic). As described herein, a CRSP protein of 
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the invention has one or more of the following activities: intracellular calcium an 
mcrease ,n phosphatidylinositol or other molecule, and can result e g in ' 
Phosphorylation of specific proteins, a modulation of gene transcription and any of the 
other biological activities set forth herein. 
5 In a preferred embodiment, a CRSP activity is at least one or more of the 

fo. lowmg activities: (i) interaction of a CRSP protein with and/or binding to a second 
molecule, (e.g., a protein, such as a CRSP (e.g., CRSP-1) receptor, a soluble form of a 

T CePt ° r ' mCmber ° fthe ^ famUy ° f ***** P r °<eins, or a 
non-CRSP signaling molecule); (ii) interaction of a CRSP protein with an intracellular 

soluble CRSP protein and a second soluble CRSP binding partner (e.g., a non-CRSP 
protein molecule or a second CRSP protein molecule); (iv) interaction with other 
extracellular proteins (e.g., regulation of ^/-dependent cellular adhesion to extracellular 
matrix components); (v) binding to and eliminating an undesirable molecule (e g a 
1 5 de ox,fv inactivity or defense function); and/or (vi) an enzymatic activity, andtn can 
thus be used m, for example, (1) modulation of cellular signal transduction, either in 
vuro arm vivo (e.g., antagonism of the activity of members of the Wnt family of 
secreted proteins or supression of ^-dependent signal transduction); (2) regulation of 
-—ion Ween cells (e.g., regulation of ^dependent cell-cell mictions) 

CRSP e.g., CRSP-1) to a receptor; (4) regulation of gene transcription in a ceH involved 
» development or differentiation, either in vitro or in vivo (e.g., induction of celZ 
differentiation); (5) regulation of gene transcription in a cell involved in development or 
differentiation, wherein at least one gene encodes a differentiation-specific protein- (6) 
regulation of gene transcription in a cell involved in development or differentaition 
wherein at least one gene encodes a second secreted protein; (7) regulation of gene ' 
transcription in a cell involved in development or differentiation, wherein at least one 
gene encodes a signal transduction molecule; (8) regulation of cellular proliferation 
either ,n vuro or in vivo (e.g., induction of cellular proliferation or inhibition of ' 
proliferation as in the case of supression of tumorogenesis (e.g., supression of 
glioblastoma formation)); (9) formation and maintenance of ordered spatial 
arrangements of differentiated tissues in vertebrates, both adult and embryonic (e g 
induction of head formation during vertebrate development or maintenance of 
hematopoietic progenitor cells); (10) modulation of cell death, such as stimulation of 
cell survival; (, ,) regulating cell migration; and/or (12) immune modulation 

Accordingly one embodiment of the present invention involves a method of use 
(e.g., a diagnostic assay, prognostic assay, or a prophylactic/therapeutic method of 
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treatment) wherein a molecule of the present invention (e.g., a CRSP protein, CRSP 
nucleic acid, or a CRSP modulator) is used, for example, to diagnose, prognose and/or 
treat a disease and/or condition in which any of the aforementioned activities (i.e., 
activities (i) - (vi) and (1) - (12) in the above paragraph) is indicated. In another ' 
5 embodiment, the present invention involves a method of use (e.g., a diagnostic assay, 
prognostic assay, or a prophylactic/therapeutic method of treatment) wherein a molecule 
of the present invention (e.g., a CRSP protein, CRSP nucleic acid, or a CRSP 
modulator) is used, for example, for the diagnosis, prognosis, and/or treatment of 
subjects, preferably a human subject, in which any of the aforementioned activities is 
1 0 pathologically perturbed. In a preferred embodiment, the methods of use (e.g., 

diagnostic assays, prognostic assays, or prophylactic/therapeutic methods of treatment) 
involve administering to a subject, preferably a human subject, a molecule of the present 
invention (e.g., a CRSP protein, CRSP nucleic acid, or a CRSP modulator) for the 
diagnosis, prognosis, and/or therapeutic treatment. In another embodiment, the methods 
1 5 of use (e.g., diagnostic assays, prognostic assays, or prophylactic/therapeutic methods of 
treatment) involve administering to a human subject a molecule of the present invention 
(e.g., a CRSP protein, CRSP nucleic acid, or a CRSP modulator). 

Other embodiments of the invention pertain to the use of isolated nucleic acid 
molecules of the invention can be used, for example, to express CRSP protein (e.g., via a 
recombinant expression vector in a host cell in gene therapy applications), to detect 
CRSP mRNA (e.g., in a biological sample) or a genetic alteration in a CRSP gene, and 
to modulate CRSP activity, as described further below. In addition, the CRSP proteins 
can be used to screen drugs or compounds which modulate the CRSP activity as well as 
to treat disorders characterized by insufficient or excessive production of CRSP protein 
or production of CRSP protein forms which have decreased or aberrant activity 
compared to CRSP wild type protein (e.g., developmental disorders or proliferative 
diseases such as cancer as well as diseases, conditions or disorders characterized by 
abnormal cell differentiation and/or survival, an abnormal extracellular structure, or an 
abnormality in a defense mechanism). Moreover, the anti-CRSP antibodies of the 
invention can be used to detect and isolate CRSP proteins, regulate the bioavailability of 
CRSP proteins, and modulate CRSP activity. The term "an aberrant activity", as applied 
to an activity of a protein such as CRSP (e.g., CRSP-1), refers to an activity which 
differs from the activity of the wild-type or native protein or which differs from the 
activity of the protein in a healthy subject. An activity of a protein can be aberrant 
because it is stronger than the activity of its native counterpart. Alternatively, an activity 
can be aberrant because it is weaker or absent related to the activity of its native 
counterpart. An aberrant activity can also be a change in an activity. For example an 
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aberrant protein can interact with a different protein relative to its native counterpart. A 
cell can have an aberrant CRSP (e.g., CRSP-1) activity due to overexpression or 
underexpression of the gene encoding CRSP. 



A. Screening Assays : 

The invention provides a method (also referred to herein as a "screening assay") 
for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) which bind to CRSP proteins or have 
a stimulatory or inhibitory effect on, for example, CRSP expression or CRSP activity. 
Modulators can include, for example, CRSP agonist and/or CRSP antagonists. The term 
"agonist", as used herein, is meant to refer to an agent that mimics or upregulates (e.g. 
potentiates or supplements) a CRSP (e.g., CRSP-1) bioactivity. A CRSP agonist can be 
a compound which mimics a bioactivity of a CRSP protein, such as transduction of a 
signal from a CRSP receptor, by, e.g., interacting with a CRSP-1 receptor. A CRSP 
agonist can also be a compound that upregulates expression of a CRSP gene. A CRSP 
agonist can also be a compound which modulates the expression or activity of a protein 
which is located downstream of a CRSP receptor, thereby mimicking or enhancing the 
effect of binding of CRSP to a CRSP receptor. 

"Antagonist" as used herein is meant to refer to an agent that inhibits, decreases 
or suppresses a CRSP (e.g., CRSP-1) bioactivity. An antagonist can be a compound 
which decreases signalling from a CRSP protein, e.g., a compound that is capable of 
binding to CRSP-1 or to a CRSP-1 receptor. A preferred CRSP antagonist inhibits the 
interaction between a CRSP protein and another molecule, such as a CRSP receptor. 
Alternatively, a CRSP antagonist can be a compound that downregulates expression of a 
CRSP gene. A CRSP antagonist can also be a compound which modulates the 
expression or activity of a protein which is located downstream of a CRSP receptor, 
thereby antagonizing the effect of binding of CRSP to a CRSP receptor. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of a CRSP protein or polypeptide or 
biologically active portion thereof. In another embodiment, the invention provides 
assays for screening candidate or test compounds which bind to or modulate the activity 
of a CRSP receptor. The test compounds of the present invention can be obtained using 
any of the numerous approaches in combinatorial library methods known in the art, 
including: biological libraries; spatially addressable parallel solid phase or solution 
phase libraries; synthetic library methods requiring deconvolution; the 'one-bead one- 
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compound' library method; and synthetic library methods using affinity chromatography 
selection. The biological library approach is limited to peptide libraries, while the other 
four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds (Lam, K.S. (1997) Anticancer Drug Des. 12: 145). 

5 Examples of methods for the synthesis of molecular libraries can be found in the 

art, for example in: DeWitt et al. (1993) Proc. Natl. Acad Sci. U.S.A. 90:6909; Erb et 
al. (1994) Proc. Natl. Acad. Sci. USA 91:11422; Zuckermann et al. (1994). J. Med. 
Chem. 37:2678; Cho et al. (1993) Science 261:1303; Carrell et al. (1994) Angew. Chem. 
Int. Ed. Engl. 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and in 
10 Gallop etal. (1994)./ Med. Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP 
'409), plasmids (Cull etal. (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage 
15 (Scott and Smith (1990) Science 249:386-390); (Devlin (1990) Science 249:404-406); 
(Cwirla et al. (1990) Proc. Natl. Acad. Sci. 87:6378-6382); (Felici (1991) J. Mol. Biol. 
222:301-310); (Ladner supra.). 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses a CRSP receptor on the cell surface is contacted with a test compound and the 
ability of the test compound to bind to a CRSP receptor determined. The cell, for 
example, can be of mammalian origin or a yeast cell. Determining the ability of the test 
compound to bind to a CRSP receptor can be accomplished, for example, by coupling 
the test compound with a radioisotope or enzymatic label such that binding of the test 
compound to the CRSP receptor can be determined by detecting the labeled compound 
in a complex. For example, test compounds can be labeled with 125 I, 14 C, or 3 H, 
either directly or indirectly, and the radioisotope detected by direct counting of 
radioemmission or by scintillation counting. Alternatively, test compounds can be 
enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase, 
or luciferase, and the enzymatic label detected by determination of conversion of an 
appropriate substrate to product. 

It is also within the scope of this invention to determine the ability of a test 
compound to interact with a CRSP receptor without the labeling of any of the 
interactants. For example, a microphysiometer can be used to detect the interaction of a 
test compound with a CRSP receptor without the labeling of either the test compound or 
the receptor. McConnell, H. M. et al. (1992) Science 257:1906-1912. As used herein, a 
"microphysiometer" (e.g., Cytosensor™) is an analytical instrument that measures the 
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the test compound to interact with a prcp 
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focused primariJy on the effect of the drug on the molecular target as may be manifest in 
an alteration of binding affinity with upstream or downstream element, AccordTn „ 
an exemplary screening assay of the present invention, the compound of interest is 
contacted with a CRSP (e.g., CRSP-1) protein or a CRSP (e.g., CRSP-1) binding 
partner e.g., a receptor. The receptor can be soluble or the receptor can be present on a 
ceH surface. To the mixture of the compound and the CRSP protein or CRSP binding 
partner ,s then added a composition containing a CRSP binding partner or a CRSP 
protem respectively. Detection and quantification of complexes of CRSP proteins and 
CRSP b.ndmg partners provide a means for determining a compound's efficacy at 
mh.bU.ng (or potentiating) complex formation between CRSP and a binding partner 

da^T C<>mP0Und ^ ^ 3SSeSSed ^ generating d ° Se -Ponse curves from 
data obtamed usmg vanous concentrations of the test compound. Moreover, a control 

ZLT T rT™* t0 Pr ° Vide 3 baSdinC f ° r — In the «>ntrol assay 
Elated and purffied CRSP polypeptide or binding partner is added to a composition 
1 5 contammg the CRSP binding partner or CRSP polypeptide, and the ^0^ 
complex .s quantitated in the absence of the test compound. 

an H/ 7,16 r 11 *^ 38835,8 ° rthC PreSCnt inVenti ° n m • menabte 10 use of both soluble 
and/or membrane-bound forms of isolated proteins (e.g. CRSP proteins or biological y 

20 h h POrtl ° ns there °f <»• CRSP target molecules). In the case of cell-free assayj in 
20 wh.ch a membrane-bound form an isolated protein is used (e.g., a CRSP target molecu.e 
or receptor) it may be desirable to utilize a solubilizing agent such that the membrane- 
bound form of the isolated protein is maintained in solution. Examples of such 
solub.hz.ng agents include non-ionic detergents such as n-octylglucoside n- 
dodecylglucoside, n-dodecy.maltoside, octanoyl-N-methylglucamide, decanoyl-N- 

glycol ether) 3. [( 3-cholam.dopropyl)dimethylamminio]-l-pro P ane sulfonate (CHAPS) 
3-[(3-cholam.dopropyl)dimemyiamminio]-2-hydroxy-l-propane sulfonate (CHAPSO) 
or N-dodecyl=N,N-dimethyl-3-ammonio-l- P ropane sulfonate. 

In more than one embodiment of the above assay methods of the present 
.nvenuon, .t may be desirable to immobilize either CRSP or its target molecule to 
fachtate separation of complexed from uncomplexed forms of one or both of the 
protems, as well as to accommodate automation of the assay. Binding of a test 
compound to a CRSP protein, or interaction of a CRSP protein with a target molecule in 
35 sZLT ^ 3bSenCe ° f 3 CandidatC Ca " be —Pushed in any vessel 

test tubes, and rn.cro-centr.fuge tubes. In one embodiment, a fusion protein can be 
prov.ded wh.cn adds a domain that allows one or both of the proteins to be bound to a 
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molecule with a CRSP protein or biologically-active portion thereof, to form an assay 
mixture, contacting the assay mixture with a test compound, and determining the ability 
of the test compound to interact with, or modulate the activity of, the CRSP target 
molecule. In another embodiment, the invention includes a compound or agent 
5 obtainable by a method comprising contacting a CRSP protein or biologically active 
portion thereof with a test compound and determining the ability of the test compound to 
bind to, or modulate (e.g., stimulate or inhibit) the activity of, the CRSP protein or 
biologically active portion thereof. In yet another embodiment, the present invention 
included a compound or agent obtainable by a method comprising contacting a CRSP 
10 protein or biologically active portion thereof with a known compound which binds the 
CRSP protein to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with, or 
modulate the activity of the CRSP protein. 

Accordingly, it is within the scope of this invention to further use an agent 
15 ,dent,fied as described herein in an appropriate animal model. For example, an agent 
identified as described herein (e.g., a CRSP modulating agent, an antisense CRSP 
nucleic acid molecule, a CRSP-specific antibody, or a CRSP-binding partner) can be 
used m an animal model to determine the efficacy, toxicity, or side effects of treatment 
with such an agent. Alternatively, an agent identified as described herein can be used in 
20 an animal model to determine the mechanism of action of such an agent. 

The present inventon also pertains to uses of novel agents identified by the 
above-described screening assays for diagnoses, prognoses, and treatments as described 
herein. Accordingly, it is within the scope of the present invention to use such agents in 
the design, formulation, synthesis, manufacture, and/or production of a drug or 
!5 pharmaceutical composition for use in diagnosis, prognosis, or treatment, as described 
herein. For example, in one embodiment, the present invention includes a method of 
synthesizing or producing a drug or pharmaceutical composition by reference to the 
structure and/or properties of a compound obtainable by one of the above-described 
screening assays. For example, a drug or pharmaceutical composition can be 
0 synthesized based on the structure and/or properties of a compound obtained by a 

method in which a cell which expresses a CRSP target molecule is contacted with a test 
compound and the ability of the test compound to bind to, or modulate the activity of 
the CRSP target molecule is determined. In another exemplary embodiment, the present 
invention includes a method of synthesizing or producing a drug or pharmaceutical 
> composition based on the structure and/or properties of a compound obtainable by a 
method in which a CRSP protein or biologically active portion thereof is contacted with 
a test compound and the ability of the test compound to bind to, or modulate (e g 
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stimulate or inhibit) the activity of, the CRSP protein or biologically active portion 
thereof is determined. 
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B. Detection Assay s 

Portions or fragments of the cDNA sequences identified herein (and the 
corresponding complete gene sequences) can be used in numerous ways as 
polynucleotide reagents. For example, these sequences can be used to: (i) map their 
respect™ genes on a chromosome; and, thus, locate gene regions associated with 
genetic d,sease; (ii) identify an individual from a minute biological sample (tissue 
typing); and (iii) aid in forensic identification of a biological sample. These applications 
are described in the subsections below. 

L Chrom osome Map ping 

Once the sequence (or a portion of the sequence) of a gene has been isolated this 
sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the CRSP 
nucleotide sequences, described herein, can be used to map the location of the CRSP 
genes on a chromosome. The mapping of the CRSP sequences to chromosomes is an 
important first step in correlating these sequences with genes associated with disease 

Briefly, CRSP genes can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp in length) from the CRSP nucleotide sequences. Computer 
analysis of the CRSP sequences can be used to predict primers that do not span more 
than one exon in the genomic DNA, thus complicating the amplification process These 
primers can then be used for PCR screening of somatic cell hybrids containing 
individual human chromosomes. Only those hybrids containing the human gene 
corresponding to the CRSP sequences will yield an amplified fragment. 

Somatic cell hybrids are prepared by fusing somatic cells from different 
mammals (e.g., human and mouse cells). As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in random order, but retain the 
mouse chromosomes. By using media in which mouse cells cannot grow, because they 
lack a particular enzyme, but human cells can, the one human chromosome that contains 
the gene encoding the needed enzyme, will be retained. By using various media, panels 
of hybrid cell lines can be established. Each cell line in a panel contains either a single 
human chromosome or a small number of human chromosomes, and a full set of mouse 
chromosomes, allowing easy mapping of individual genes to specific human 
chromosomes. (D'Eustachio P. et al. (1983) Science 220:919-924). Somatic cell hybrids 
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2. Tissue Typi n ^ 

considering the use of restrict ft ' ^ """^ fc ' is 

■5 identification of Z ^ T„ *r I' POlym0 ' PhiSm <RFLP ' f " 
digested with one or IT " i " diVidUa, ' S S elw ™ DNA is 

of -do 8 ^T::::™::^ ~ T r - *• — - 

difficult The sequence, of * ' mak ' n6 posi,ive 'dentif.cation 

selected portions of at individual genome 2 s Z CR^T T^T""" * 
described herein can be used to pJL two PC R t m f "f 

urpreser ol such DNA sequences due to allelic di«i„„„ xi. 
3 present invention can be used tn„l» • t , "differences. The sequences of the 

-d from U S sue tTcrsp It, ,T tifiCa,i0n "*—■ fl ™ 

portions of the hJ^Tt, * ZT ^ ~ 
regions of the, sequenL Id .o^Z^Ta "",*^ «*■ 
estimated that allelic variation h*» ^ noncodtng regions. „ is 
about once per e ch C T tT°, """^ ™ B " ilh « of 

for idenuucadon Bel " """^ " * 

purposes. Because greater numbers of polymotphisms occur in the 
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noncodtng regions, fewer sequences are necessary to differentiate individuals The 
noncoding sequences of SEQ ID NO:l, SEQ ID NO:4, SEQ ID NO:7, or SEQ ID 
NO:10, can comfortably provide positive individual identification with a panel of 
perhaps 10 to 1,000 primers which each yield a noncoding amplified sequence of 100 
bases. If predicted coding sequences, such as those in SEQ ID NO:3 SEQ ID N06 
SEQ ID NO:9, or SEQ ID NO: 12 are used, a more appropriate number of primers for 
positive individual identification would be 500-2,000. 

If a panel of reagents from CRSP nucleotide sequences described herein is used 
to generate a unique identification database for an individual, those same reagents can 
later be used to identify tissue from that individual. Using the unique identification 
database, positive identification of the individual, living or dead, can be made from 
extremely small tissue samples. 
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3. Use of Partial CRSP S^,. P n. es in Forensir Rini» ry 
DNA-based identification techniques can also be used in forensic biology 
Forensic biology is a scientific field employing genetic typing of biological evidence 
found at a crime scene as a means for positively identifying, for example, a perpetrator 
of a cnme. To make such an identification, PCR technology can be used to amplify 
DNA sequences taken from very small biological samples such as tissues, e.g., hair or 
skin, or body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified 
sequence can then be compared to a standard, thereby allowing identification of the 
origin of the biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PCR primers, targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (i.e. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme 
generated fragments. Sequences targeted to noncoding regions of SEQ ID NOs l SEO 
ID NO:4, SEQ ID NO: 7, or SEQ ID NO: 10 are particularly appropriate for this use as 
greater numbers of polymorphisms occur in the noncoding regions, making it easier to 
differentiate individuals using this technique. Examples of polynucleotide reagents 
mclude the CRSP nucleotide sequences or portions thereof, e.g., f ragme nts derived from 
the noncoding regions of SEQ ID NO:l, SEQ ID NO:4, SEQ ID NO? or SEQ ID 
NO:10, having a length of at least 20 bases, preferably at least 30 bases. 

The CRSP nucleotide sequences described herein can further be used to provide 
polynucleotide reagents, e.g., labeled or labelable probes which can be used in, for 
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Zf * T hybridiZati ° n tCChniqUe ' t0 identif ^ a tissue, e g brain 

C * Predictive TvWiVj nf ,- 

1 » The present invention also pertains to the field of predictive medicine ,„ . 

dtagnosnc assays, pro gnostic ^ ^ m c J " <** 

prognostic (predion, purposes „ ^ _ prop ^J°' 

Accordingly one aspect of the present tavention relaKS „ d JJJ^" 

disorder, associated with aberrant CRSP «,„«.« • . . developing a 

proliferation dlflW * . expression or activity, such as aberrant cell 

proliferation, differentiation, and/or survival resulting for example in a 

Lrd^ 

ciatea with CRSP protein, nucleic acid expression or activity For 

Itr* 8 CRSP ^ 66 h ' « -y s 

„d- „ T Pr06 °° S " C OT Predi " iVe PU ^ '° ^ Pbophytatieallv ties, aT 

Another aspect of the invention pertains to monitoring the influence of a e ents 

~d °r s) on the e ~ ° r ° f crsp * si 

These and other agents are described in further detail in the following sections. 
30 Diagnostic Assay s 

An exemplary method for detecting the presence or absence of CRSP nrm*' 
"ucletc acid in . biologica , ^ b *^j£f^"" , t " 
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The nucleic acid probe can be f 

nucleic acid of SEO rn xr^ ' exam P ,e ' a foil-length CRSP m,oi ■ 

15 «. u dedt ° encom Pass directly..- ! h regard to to probe or 

*« a subject, as WeJJ . ^ - logical fluids ^ 

Section method of the invention c k * PW " rt Within * object Thlt k 

genomic DNA in u- ■ Canbeused »o detect CR^p D1 * ,s ' * e 

Southern hybridiV at ; ques for Section of CR<!P 

« one embodiment th» k- , . iec ™'ques. 

, m Samp,e isoI ^d by conventions A bio '°gical 

in another embodiment th, ° mean s from a subject 
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compound or agent capable of detecting CRSP protein, mRNA, or genomic DNA, such 
that the presence of CRSP protein, mRNA or genomic DNA is detected in the biological 
sample, and comparing the presence of CRSP protein, mRNA or genomic DNA in the 
control sample with the presence of CRSP protein, mRNA or genomic DNA in the test 
sample. 

The invention also encompasses kits for detecting the presence of CRSP in a 
biological sample. For example, the kit can comprise a labeled compound or agent 
capable of detecting CRSP protein or mRNA in a biological sample; means for 
determining the amount of CRSP in the sample; and means for comparing the amount of 
CRSP in the sample with a standard. The compound or agent can be packaged in a 
suitable container. The kit can further comprise instructions for using the kit to detect 
CRSP protein or nucleic acid. 
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2. Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant 
CRSP expression or activity. For example, the assays described herein, such as the 
preceding diagnostic assays or the following assays, can be utilized to identify a subject 
having or at risk of developing a disorder associated with CRSP protein, nucleic acid 
expression or activity such as a proliferative disorder, a differentiative or developmental 
disorder, a hematopoietic disorder as well as diseases, conditions or disorders 
characterized by abnormal cell survival, abnormal extracellular structure, or an 
abnormality in a defense mechanism. Alternatively, the prognostic assays can be 
utilized to identify a subject having or at risk for developing a differentiative or 
proliferative disease (e.g., cancer). Thus, the present invention provides a method for 
identifying a disease or disorder associated with aberrant CRSP expression or activity in 
which a test sample is obtained from a subject and CRSP protein or nucleic acid (e.g, 
mRNA, genomic DNA) is detected, wherein the presence of CRSP protein or nucleic 
acid is diagnostic for a subject having or at risk of developing a disease or disorder 
associated with aberrant CRSP expression or activity. As used herein, a "test sample- 
refers to a biological sample obtained from a subject of interest. For example, a test 
sample can be a biological fluid (e.g., serum), cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) 
to treat a disease or disorder associated with aberrant CRSP expression or activity. For 
example, such methods can be used to determine whether a subject can be effectively 
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treated with an agent for a disorder, such as a proliferative disorder .hot ■ 

deve opmental disorder » h» m . • • border, a differentiative or 

defense mechanism. Alttmati t^ 
whe therasn ^ £££ TT , " Ve,ni0 " Pro "' tes '»^fo r de t en t ,i„ i „ g 

10 CRSP protein or mieleic „,iH . ■ detected (eg., wherein the abundance of 

adn.ini .ered ZZZZZZ? " a subject rha, cm, be 

« agent co treat a disorder associated with aberrant CP qd » 
activity.) <werrant CRSP expression or 

1 5 disorder characterized hv »K . 7 , gCne is at risk for « 

affecting the mtegri Jof .T^ T°" Cha " CK,i2ed b ? « •» of an alteration 
20 CRSP gle. ForSLple mch " S 8 CRSp -P«><ein, or the mis-expression of tile 

n an addition of olTmore ^ 'j™ "fT *» ' ««P gen. 

an alteration in the level of a messenaerRMAt <«igement 01 a CRSP gene; 5) 

25 modification of a CRSP JL 7 ™ ***** ° f * CRSP * ene > 6 > ab ™ 
7)thepresenc e :Z n ~ 

CRSP gene, 8) a non-wil J^e levero^a CRS^™ °f 2 m ^ SS ^| lger ^ A transcript of a 
and.O)inapp^^ 

s~pL^ 

probe/orir^ 11 deteCtion alteration involves the use of a 

probe/p„mer ma polymerase chain reaction (PGR) (see ee US Z ,m 
4,683,195 and 4 fisn , v ^iv M see, e.g., U.S. Patent Nos. 

— f«t detecting pL^'LI^S^^ " toP ^ 

c ^^^r-gene (see Abravaya e/ a/. (1 995) 
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Nucleic Acids Res . 23 -675-68?} n,;* *u . 

from the cells of the sa mni» . • ,eic acid (e.g., genomic, mRNA or both) 

presence « aosence a.pHfiJon ^ /l^r '.T" 5 ' *" 
amplification product and com™ ■ *u . «tecting the size of the 

*>, p CR jl LCR : a len8ih • a controi « - 

conjunction wid, my oTItZ ^ " * Pre " mi,,a ' in 
Alternative ^ i tad hLin. 

ampltfiction systen, (Kwoh, D.Y. «„/ ,o 8 , P ~ M ' a ' ranscri P«<»«' 

o*er nucleic acid amplification method 1,,, ! ' B f r « cl ~*W *H 97), „ r My 
15 tno,ecu,e s Mtag Z^f! ? * "'^ ° f 

schemes are especiaUyL Jfo *eT, " t Tbese 

» sample and comro! DNA is i s „IT TrT """^ Fot «"»■*. 

«**. endonucj^ Jf^e * ^ « « -« 

elecuophoresis and compamd of* 8 " de,em, ' ned ^ 
-.re, DNA ^ZIZZZ^^** *«« - 
~» rentes (see . for e«H ^^t^n T "* 

*9>. For examp,e, genctic ^ f 

arrays containing Ught-genamJ TrZ f * Kto "" le < 1 in "™ «inte„si„„a, 
Briefly, a fim, CJ^ZZTZT V"*" " ^ M T ' « « «•« 
°'DNAi„asamp,eandc„Z, 0 e^TlT 'j" - * — £l 
-** linear ataays „ aat, JL ^S^^*- ~ * 
of point mutations, tu, ^ js foIIowe f hv * ^ S s,e P the identification 



WO 98/46755 

PCT/US98/07894 

-70- 

^ptoemary ,„ a,, varianK or muMons ^ ^ 

of paraUe, probe sets, one eontptaentary ,„ the wUd-type gene and the ^ 

complementary to the mutant gene. 

5 ,h. „ T " m " her embodimM - "W °f' venety of sequencing reactions too™ in 
5 he an can he used to directly sequence rhe CRSP gene and detec mJ^T 
comparmg Ote sequence of the sanrpie CRSP with the corresponding wi L nmih 
equence Examp,es „ f sequencing reac|io „ s ^ ^ J^^*-* 
e doped by Maxim and G^ber, « 19 ,7) ^ 74:560) or ^ 
10 n^Lf ' s f° contemplated thar any of, variety of automated sequencing 

ESSIE- u,iliMd when performin8 * e diagMi < ^ «"«) 

1 9.448), includmg sequencing by mass spectrometry (see e s> ppt t ♦ • , 
PubHcation No . WO 94 /16 ,01 ; Cohen L ^llc^^T^ and 
Gnffine,*/. (1993)^/. 38:147-159). ' ' 

Other methods for detecting mutations in the CRSP gene include meth^c ■ 

such, tvhtch wii, exist due ,„ basepatr mismatches between the con JId ^pf 

hybnds treated wtth S, nueiease to enaymaticaHy digesting me mismatched "t l 
25 n^^'^ DN ^ NAM ^ NA ^«--beteeated^r 

seoZed bv sl? 7 m ' SmaKhCd ^i 0 " 5 ' « *en 
sepamted by s,* „„ denaturing polyacryiamide geis to determine the rite of mutation 
See, for example, Cotton at al. (1988) Proc. NcUcadSc, USA 85 4397. Z7T , 
0992) MahodsE^ymol 217-286 70S i„ r , 1 Sak,!ba <" 

30 RNAermbCabeledTde^io! preferred embodiment, the control DNA or 

In still another embodiment, the mismatch cleavage reaction ™„i„ 
™ Ptoteina tha. meogni. mismatched base prira in ££££ ^ 

mismaKh enzymes) in defined systems fo, deteotin, »„d m , 

-orations i„ CRSP cDNAs ohteined from samp.es -i^^TST 

15.1657-1662). Aceontag to m exemplary embodiment, a probe baaed o„ aCRSP 
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sequence, e.g., a wild-type CRSP sequence, is hybridized to a cDNA or other DNA 
product from a test cell(s). The duplex is treated with a DNA mismatch repair enzyme, 
and the cleavage products, if any, can be detected from electrophoresis protocols or the 
like. See, for example, U.S. Patent No. 5,459,039. 
5 In other embodiments, alterations in electrophoretic mobility will be used to 

identify mutations in CRSP genes. For example, single strand conformation 
polymorphism (SSCP) may be used to detect differences in electrophoretic mobility 
between mutant and wild type nucleic acids (Orita et al. (1989) Proc Natl Acad, Set 
USA: 86:2766, see also Cotton (1993) Mutat Res 285:125-144; and Hayashi (1992) 
1 0 Genet Anal Tech Appl 9:73-79). Single-stranded DNA fragments of sample and control 
CRSP nucleic acids will be denatured and allowed to renature. The secondary structure 
of single-stranded nucleic acids varies according to sequence, the resulting alteration in 
electrophoretic mobility enables the detection of even a single base change. The DNA 
fragments may be labeled or detected with labeled probes. The sensitivity of the assay 
1 5 may be enhanced by using RNA (rather than DNA), in which the secondary structure is 
more sensitive to a change in sequence. In a preferred embodiment, the subject method 
utilizes heteroduplex analysis to separate double stranded heteroduplex molecules on the 
basis of changes in electrophoretic mobility (Keen et al. (1991) Trends Genet 7:5). 

In yet another embodiment the movement of mutant or wild-type fragments in 
20 polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al, (1985) Nature 313:495). When 
DGGE is used as the method of analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a GC clamp of approximately 40 bp of 
high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 
25 used in place of a denaturing gradient to identify differences in the mobility of control 
and sample DNA (Rosenbaum and Reissner (1987) Biophys Chem 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
30 known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al (1986) Nature 
324:163); Saiki et al. (1989) Proc, Natl Acad Sci USA 86:6230). Such allele specific 
oligonucleotides are hybridized to PCR amplified target DNA or a number of different 
mutations when the oligonucleotides are attached to the hybridizing membrane and 
35 hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on 
selective PCR amplification may be used in conjunction with the instant invention. 
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end of one primer when* ,mH 0 , . or at the extreme 3 

durable lo introduce a novel restriclio^i, u In ad *"™ « may be 

j- certain erabodiraenta a^, J f ™ ™« *<> " * -**~ 
araplrficadon (Barany (1991) ^ „„, ^ J^*™* 7* ^ ** 
'0 w,„ occur o„ ly if fere is a perfeo, match a , „ 3 „"tl e 5 ! 

possible to detect the oresen™ „p c sequence making it 

Presence or abse J ^ o. ""^ " 3 *"«• * * *r the 

Men, S exhib W n 8swtomsorfmi|yh J U - d : ;~ 

gene. 7 ui^ase or illness involving a CRSP 

Furthermore, any cell type or tissue in which prcd : 
in the prognostic assays described herein. ° qMBd may be Uti,ized 

ascription in a c J involv 1^^^^^ 
25 proliferation) can be applied not onW I Z A ^ u,atio " Cellular 

For example, the ^L^^2^ "* *° * "** 
herein to increase CRSP gene ex JLT * * SCreening as desCT *ed 

can be monitored in cllicT^ T Pr ° tem " CRSP 
reu in Clinical trails of subjects exhibiting decrease men 

expression, protein levels, „ r dovntrenulated CRSP , r 
30 effectiveness of an agent datennZT *' A,KraMi ^. 

expression, prorein ^tZ^SSTT " T" <** ~ 
Wis of subject exhibiting incraI2 CRSP *' °" " """ li, ° red 

upragulated CRSP activity Z h , t ° iPreSSi °°' """"" " Vds ' or 

preferably, otiC LtatLI" "? ' ™«y of CRSP 

35 diaordercbe^afr^ 00^7^^'°"^'*'- 
For exaraple, 1 „^ s " "T^ ° f "* <* • P^cular eel,. 

s nt te.g., compound, drug or small molecule) 
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which modulates CRSP activity (e.g., identified in a screening assay as described herein) 
can be identified. Thus, to study the effect of agents on proliferative disorders 
developmental or differentiative disorder, hematopoietic disorder as well disorders 
characterized by abnormal eel, differentiation and/or survival, an abnormal extracellular 
structure, or an abnormality in a defense mechanism, for example, in a clinica. trial cells 
can be isolated and RNA prepared and analyzed for the levels of expression of CRSP 
and other genes implicated in the proliferative disorder, developmental or differentiative 
disorder, hematopoietic disorder as well as disorders characterized by abnormal cell 
differenuauon and/or survival, an abnormal extraceUular structure, or an abnormatity in 
a defense mechamsm, respectively. The levels of gene expression (i e a gene 
express.on pattern) can be quantified by Northern blot analysis or RT-PCR, as described 
herem or alternatively by measuring the amount of protein produced, by one of the 
methods as described herein, or by measuring the levels of activity of CRSP or other 

15 Z S V n ^ ^ 8Cne eXPreSSi ° n ^ ^ SCrVe 33 3 marker ' ind -tive of the 

^ZiT^Z° f ^ nSt0ihea8ent AcCOrd ^y,thisresponsestatemaybe 
determmed before, and at various points during treatment of the individual with the 

In a preferred embodiment, the present invention provides a method for 
momtonng the effectiveness of treatment of a subject with an agent (e.g., an agonist 
antago„ lst peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate identified by the screening assays described herein) comprising the steps of ( ) 
obtaining a pre-administration sample from a subject prior to administration of the 
agent; („) etecting the level of expression of a CRSP protein, mRNA, or genomic DNA 

*lT a T m 7T SamP,C: (Ui) ° btaining ° ne ° r m ° re P^dministration samples 
from the subject; (tv) detecting the level of expression or activity of the CRSP protein 

mRNA, or genomic DNA in the post-administration samples; (v) comparing the level 'of 

expression or activity of the CRSP protein, mRNA, or genomic DNA in the pre- 

admmistration samp,e with the CRSP protein, mRNA, or genomic DNA in the post 

administration sample or samples; and (vi) altering the administration of the agent to the 

subject accordingly. For example, increased administration of the agent may be 

durable to increase the expression or activity of CRSP to higher levels than detected 

the 1 7 reaS t th I effeCdVeneSS ° f 3gent - A,temative ^ Creased administration' of 
he agent may be desirable to decrease expression or activity of CRSP to lower levels 
than detected, i.e. to decrease the effectiveness of the agent. According to such an 
embodiment, CRSP expression or activity may be used as an indicator of the 
effectiveness of an agent, even in the absence of an observable phenotypic response 
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CRSP protein activity associated with the cell a 

-vny can be an agent „ ^ ^ ^ 

naturaliy-occurring target mo ,ecu,e of a CRSP ? ^ «*' W * ^ a 
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acnv.ty of a CRSP protein or nucleic add m oI 7 * aben *» ex P-sion or 
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1 5 ^ ° r C ~n of agents tht ^ 
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of soeh a Nation is where j^^^ Ye, ano,h er exampl 

(eg, whe re a su6j e cl to aCl "7 e Nation ;„ , subjec , 

■|'"*«^to.2t ,,, * lh -*«ii 
-viva, in a sabjee^ whidl „ ?' donation, a^ 0 r 
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5 cancers and S ~ 8S ^ f °- - 

■ Mod. m a„ ~ IT CRSP ""~ <■* CRSP. 

conditions. The condition to be ttjrf 1 Premall8nam co " dili ^. and benign 
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The invention also provides mRth^c r 
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In yet another embodiment, the invention provides a mP thnHf _ • 
can be used to inhibi, mZZZTJ^ZZ '"^ *~ 

assoc ated with a cpntroi ^ • i_ . g aiseases or conditions 

include, but are not limited to- i ^ • • S ° rderS of the nervou s system 
with surgery T ^ ^ ,esio " S 

syndrome), poliomyelitis and h„~r* ™ of childhood (Fazio-Londe 

Tooth disease) ^ m0t0rsenso ^ — pathy (Charcot-Marie- 

be readministered to the subject. aitterentiate. The Ussue can then 

35 Among the approaches which may be used to ameliorate A- 
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overexpressing a receptor for CRSP. expressmg or 

3- Pharmacn genomirg 

developmental disorders) associated with aberrant CRSP ^ ' 8 ' ' ^ " 

such treatment, pharmacogenomicsrie 2 ,7 V f * faow *«*»'*h 

» individual- -P^-S^^^r^^- 

be considered. Differences in metaboliJ nfT 8 " C ° mp0Und ° r dru S> ™* 

phaimacologicaliy active dn,„ Th„. ,. . Sea " dbloodc< " 1 «"<ralionoflhe 

knowiedge i. iTta ^ , p^l^' 8 " " C,iniCian ^ C °" Sider 
» adnrintaer . CRSP JSnST T"~ * «— *• „ 

pnarn.acogen.tic cocoas can be difFerentialed Genei' T ' W ~ ° f 
conditions iransmitied as sln„l. f . 7 tog acllon > or »»>eiic 

dehydrogenase decieZoZr ^ ** eXmP ' e - *«~«1*-pta. 

main citiuca! conrlZ " "T"" inherited e ° Zyn, °'' a,h >' ™ » hi <* <* 

smfonanridos, aT X „ L^I T' S d inSeS " 0 ° ** 
. analgesics, nnrofiirans) and consumption of fava beans 

-pof^hZg^z^rrra^^'""'^ 1 *" 

allelic- gene marker mZwH T y 0W " ^'a* markers (e.g., a "m- 

marker map wh,ch con S1 s, s of 60,000. ,00,000 polymeric „ ^ 
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"gnifican. number of paLts I " " 0f a SBlisli «* 

single nucleohde pol y mo,ph isms (S lT" th T ° f "°" ^ 

"SNP- is a common a,,e ra l n Ta I 8en0me ' M °*°> h ^'n, a 

DNA. Fo, exampie, a S NP l^oT , ^ ^ 3 

be grouped ,„ W 6enelic ca,e 6 „„ es ^ ° ^ rf ,u * SNPs - caa 

individual ge„„ me . ,„ such rZ^^T ' Pa " iC '" ar """^ of SNPs * •* 

'nchgepeucallyrinritaHndi^^ 

Alternatively, a method termed the v»n^ -a . 
Mean* 8 = ms lhal ™1 A r * ^ aPPI ° aCh "' " <* - 

encodes a druga is kao™^ a C R S 7 8 '° "* ^ •« 

Popoladon aad i. eaa be detamtoed if hi 7 " ™* y "> *• 

» aasoeia.e, . p^ ™ ^ «* - of 4= geae ve K aa anod.ee ia 

gc-jepoW^of , a „ g d ~?N r^^^ 

2, aad cyoahrome P450 CYP2D6 Ll^c o> " 2 ^ 

5 « to why „„, pau^ do J" c ™ C 1 9 > ^ P'°"ded aa explananon 

*• response aad serious Wd^LtZT ^ *" —— " 

Tlreae poW^ „ """J SKn<tai ^ «* *>se of a drug. 

nifferautpop^atio^. For example Z ^ ° fPM iSd,flere " *n°n 8 

-* -era, masons CT^ "** " ■»*»»■*■« 

experieaee exaggerated drug respoTOe !! Z™? CYP2C1 ° 1* ftequenriv 
•f ' n,e Bb o, ite „ lhe acti ve Im"^ Si*" f — " *~ 
demons.ra.ed for the amdgesic effec. ofTT "° """P""* ™P°nse, aa 

-* monahine. * " s 

* « -pond .„ sandard T"? "" ra - raPid ~ Wi ~ »»° 

has been idcn rif le<i , 0 K'^ST" ^ 

10 LYP2 D6 gene amplification. 



10 
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idenff Alten,at ; Vely ' 3 meth ° d temied thC CX P ression P-fi'ing", can be utilized to 

t p T ct dmg response - For exampIe ' ^ ■« e ~ or" 

an m l d d ^ a drug a CRSp ^ cRsp 

Information generated from more than one of the above pharmacogenomics 
approaches can be used to determine appropriate dosage and trelent re, Z for 
prophylactic or therapeutic treatment an individual. This knowledge, o 
dosing or drug selection, can avoid adverse reactions or therapeutic aihire ZTus 

This invention is further illustrated by the following examples which should not 
be onstrued as limiting. The contents of all references, patents and publishe d patlm 
apphcanons cited throughout this application are hereby incorporated by ^ ^ 



EXAMPLES 



25 



30 
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E2^n B lel: Nation And rh.rnrtoriT ntinn nfHuman rilKP., gDNj 

CRSP uT ^ iS ° ,ati0n ^ CharaCteri2ation of ^e gene encoding human 

CRSP-1 (also referred to as "CRISPY-1 - or "TANGO 59") is described. 

Isolation of the human rp sp.i r r>MA 

The invention is based at least in part on the discovery of a human gene encoding 
a secreted protem, referred to herein as Cysteine Rich Secreted Protein- 1 (CRSP a 

tTs J ; * was ; s : ,ated using a signai sequence ^ ™. ~ y 

takes advantage of the fact that molecules such as CRSP have an amino terminal sill 

sequence which directs certain secreted and membrane-bound 

cellular secretory apparatus. 8 

Briefly, a randomly primed cDNA library using mRNA prepared from human 
fetal brain tissue (Clontech Pain AUn j , numan 

cDMA 5 ^ • r „ 6Ch ' Pa, ° Alto CA > was m ade by using the Stratagene-ZAP- 
cDNA Synthes ls r« klt , (catalog #20041). The cDNA was ligated into me marnmalian 
express vector pTrap adjacent to a cDNA encoding placenta, al ^ZZ^ 
adcmg a secretory signal. The plasmids were transformed into E. coli and DNA was 

cosT r g w r d ™ DNA purif,cation kit (iwga >- ™* - 

cuj>-7 cells with lipofectamine™ (Gihm rrt \ a * ^» L . 

(<Jibco-BRL). After 48 hours ncubation the COS rHi 
sup^^ were ^ foralkaline on a ^J^^ 1 
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counter using the Phospha-Light™ kit nw , 

P'asmid DNAs scoring positive „ ZSZ^^fT* ^ 
were f urther ^ fey ^^£^»>+^ secretion assay 
Using a partiaJ cDNA isolated I b v fh k ' Pr ° CedUres - 

^ cDNA encoding hu m an «** 3 ™ ^ 

length h CRSp , protein «~e seance encoding^ « 

The full length protein encoded by this nuclei " *** 38 SEQ - ID NO: 1 . 
«* a,d has the amino acid sequLtswl p " 350 «*"> 

The c d, porti , n (open ^ J f 1 and set fortn as SEQ ID NO:2 . 

AILtas Accession No. 98634 

^•feoj^w^ 

quence set forth in SEQ ID NO:2 indicated ih* KiP 1 tav '"S Ihe amino 

f maboutammoacid , to about amino acm CseoTmT' ' -+o 
ammo «*> ^"ce SEQ ,D NO-2 uato. if , * FUrth,:r ""yaia »f 

J"*-* , he preseiKe „ f a ^ > *MI Peptide prediction programs 

'.2i,ot23ofSEQ IDNO :2 a2££^ """^ ^ ' * *" «*» acid 

20 r"" 3dd 3 *> °«EQ ID NO:2. The pre^l 7'"° * % " 2 " '° 

*« *. CRSP-, has „„„ idemMed *• W sequence, in addition to the 

Examination of the cDM a co 
CRSP-, ^icniat^c,,^^;^^^^ , 

' M "^20cys K mere S id U e s , o Led !cr °™" " F ' 8Ure '• C *SP-I 

■QIDNftz Theaecstein^^ZT^f ' 

^LASTsenmh^.^^^^'Omsmfldebridgea. 
r^^^^ce.ofC^/^^^^'^^^ofm.nnc,^ 

'° a * Cke " ">NA encoding ap o^n „ f t CRSP -' ' S S '' S "*""^ 
Accession No. wo,,. ^ C DNA wa t a ^dT" 0W " fU " C " 0n °'" B ^ 
*o«n to he expressed in lem fibTa^ 3 ,MS cDNA and 

; - CRSP -' the chicken 
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~, y high in fc cystei „ e * • **- Protein and human CRSP-1 ia 
acd s ,47 and 2*4 of SEQ ID NO- 2 .„ nl " , ,. " l0CaKd bMw «" «*» 

fc „££^™»* <~X,, were ^ in . 

promotes ,nm„rig e „esis. A sche 'f """""I • *Mon in chromosome 10 that 

*e stances of me CRS^^oT ** 
region of identity ba,wcan CgSKm^TJT™ " ^ * ^ 
» W* of the huma „ CRSp ., J^- ^"»*» W comprises . region of fte 3. 

differentia, screen a„ d sub * *» described 

messages as depicted. R> 0 -,L 7 Ha Z.M T W0 ^-'^ M 

Length ?aalm2; md ^ Cap Pe„ a , ly: 4 , Gap 

20 K'Me: 3; Gap j. w * ' ™* f ^f" L 'P™» DNA alignmen, 

Pro,ei„ lacks me N-«nninal ail, L ' RSP "' """""e* "* ™»ded 

— - Promim The ■ " - » * * 

«*» of me initial cDNA. ££££' < £? " «SP-, in me 

suggest mat CRSP-1 may te imponaill CRSP -' ™'° »™an glioblastoma and 

* A tole i„ g, ioblas|on , a „' ™ ° f "» Phenotype. 

observed in hnman ta t d f T' ^ 

1. RIG and RJG-lilce genes ,0 a rea " ! f i, *' ^'^^on of the CRSP- 

^velopmen, of homa! J£EZ£T~ " '"^ ^"^ " 

— -m^^^^-^neesimi^io 

under snrngen, c„ ndilions . „ o.2xSSC at 65°c. 
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Hybridizanon of a Cl„„, n , " "* descnbed >*l°w. 

5 heated to preseiw ^cr " , ' llm8 ' - «*W 

'owe, leve,s ofCRSP-1 m ^A ^ ^ * ^ brai "' and s, ig h„ v 
' -RNA was bmd in fela , ...J" '«* ^ H »»-e, „o s i 8niftamt |eveJ of ^ 

f-^^!^^^**^ *«— (MTN, b ,„, 
10 **« — *L kidney, and pancre^t™ ^""^ *« 
P-seoce of high leveb of CRSP.,7™' K RS,M **— "* 

brain, and 

musCe. However, „„ Mp^^^lT** «" *» «— h ado,, 
«** l,ver, kidney, or pancreas , ' mRNA "» ob s=rved in 

« «SP, waa not ea pre Ld a, ^e ^ T^"" * * h °™°^ » 
Dw- Biol. 40:53 1). We ' eVels m (Sawada et al. (,996) , M j 

-ions were hyhHdized „„h CRSP^^ ™ pT^"" *» 

f^T ^ (re "" a ^ '-* ~ oC »S I" ~ ~ in hear,, 

' mRNA was o-Proascd in heart, lun . eye ' ° ' '" 4 5 embryos. CRSP- 

«% (foot, ,„chea, hnynx, brai ° C«MX and 

■"■"—A <**• (aoia only), eye (renl l d 7 ^ 

~" *— * brain, heart, and pt, ^ — * "» *• —« 



Constructs 



Expression constructs for two fonns of CRSP , 
mamma,ian expression vector pMET stoTp , ^ Usin S *° 

*e co mp ,ete 350aa CRSP-, J^* T ^ C ° m ^ d a cDNA incorporating 
35 -prised tne entire CRSP- 

-ids (CRSP-l flag , hort)< A c. "^ ding SC<JUence e -ept for the final 18 amino 

(DYKDDDDK) was added to b£S£~ * epitope 

thCRSP-lfonnsfore^^^^ 
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CRSP-lflag cDNAs were generated by PGR from a full length CRSP-1 cDNA template 
and hgated into pMET-stop using EcoRl and Sail restriction sites. 

Trial transfection - small scale expression 

5 Expression constructs for CRSP-1 flag.long and CRSP-1 flag.short were 

transfected into 293T cells using 10 pi of lipofectamine (GIBCO/BRL) and 2 ug of 
DNA per well of a 6-well plate of cells which were 70-80% confluent. After 5 hours at 
37 C. cells were fed with 1ml of 20«/oFCS/DMEM. After incubation overnight at 37°C 
cells were conditioned in 1ml OptiMEM for 48 hours at 37°C. Samples of supernatant ' 

^Vpirf 7^ SOlUbUiZed " b0iHn8 SDS " PAGE gd buffer > ™ «* « a 4-20-/0 
SDS-PAGE gel, transferred to a nylon membrane and probed with the anti-FLAG 

monoclonal antibody M2. Samples from both supernatant and pellet samples showed 
significant immunoreactivity within a molecular weight range of 40-65 kDa on 
autographic film using a HRP conjugated secondary antibody and ECL detection 
15 reagents. Thus, both forms of CRSP-1 tested are secreted from 293T cells thereby 

confirming experimentally that CRSP-1 is a secreted protein. It should be noted that the 
molecular weights of both forms of CRSP-, tested are greater than predicted from the 
ammo acd sequence, suggesting that the CRSP-1 proteins secreted by 293T cells may 
be glycosylated. This is consistent with the presence of four potential sites for N-.inked 
20 glycosylate m the CRSP-1 protein. 

Large scale CRSP-1 protein production 

For scale-up of CRSP-1 flag.long protein expression, 30 x 150mM plates of 293T 

25 Him ! ZtT^ WCre tnmSfeCtCd With 27 M DNA ' 100 * "Pofectamine in 
18 ml OptiMEMfor5 hours at 37°C. 18 ml of 10%FCS/DMEM was added to each plate 
and incubated overnight at 37°C. 24 hours after the start of transfection, transfection 
supernatant was aspirated and 35 mis OptiMEM was added to each plate and the plates 
mcubated at 37T for 72 hours. Conditioned medium was harvested, spun at 4000 rpm 
for 30 mi, at 4«C, and filtered through a 0.45 micron filter unit. „ 00 ml was passed 
30 over a 1.6 x 10 cm anti-FLAG M2 affinity column pre-equilibrated in PBS pH7 4 buffer 
at a flow rate of 2.0 ml per minute. After washing with 200 ml of PBS pH 7 4 buffer 
bound material was eluted by a step of 200 mM Glycine pH 3.0 buffer and 0 5 ml ' 
fractions collected. Upon e.ution, a significant protein peak was detected by absorbance 
at 280nm. Samples corresponding to conditioned medium, flow through and eluted 
15 fractions were analyzed by Coomassie blue and silver stained SDS-PAGE and by 
western blot analysis as described above. Significant immunoreactivity within a 
molecular weight range of 40-65 kDa was detected in conditioned medium and eluted 
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the predominant immunoreactive nm,~ 5>DS-PAGE gels suggested that 

s u _ j J , luun °reactive protein constituted >90% of thcnr^' 

To identify novel proteins related to CRSP 1 th. i. 
-uence watt used ,„ a.^ fc prjvaB ™ * « amino acid 

(W^hUversion, 2.0 BLOSUM62 , \ USmg ^BLASTN 

» protein „ * 7 ^ ~* ta **« 
CRSP-I. These mfm J l """""^ *° hl "™ 

nCRSP- 2> h . CRS p q 3 , „.c Rs ;: S cRsP-Z * * — hCRSP * 



20 

2 



sequence inflation corobinedwith adit " """^ furth " «■ «* 

* --"■Wypro 8 ^P W 7^eTuw^rrr CKfr0mdbESTUSto ^ e 
sentence depicted in Figaro 3 pi f » *"* " M «> te "CRSP-3 

ATCC as Accession J»3 Z^T^ 75 ^ *»— -* *• 
^c f „r„CRSP.3 rcve J^ 

Placenta. CRSP-3 mRNA exor^ t. eXPrcSSi<>n <° 

adult hutnttn tisauca ' " " *— — '» -to nomta, 

molccule (Nature, m , 357.368). d J,td^ rib T "** ' " *"■"■« 

forrnation during early XmmJJ!! " ° ° f ""d 

Given the homology between CRSP-3 and dkk l it 



30 

dkk 

35 
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Proteins has been w^nl " ■«**»■ Ove,exp ression of „„, 

5 digan and Nusse, Genes and Dev /wi,^,* * to ™«* (see 

****** other CRSP familv m 'e mber T , 3305) - ""^w. a. leasl CRSP-3 
« signaling in CMCer ,y m ™^. ™y a,so p,a y a r„ le in lhe suppression * 

accession number WSsllT^T^ fr ° m a sin 8 ,e dbEST clone with 

Northern Mo, an„ ysis of vajous ^ *Pta* in Figure 4. 

«- CRSP-4 mRNA was .^T^;?* ^ *» -ea,ed 
"predion . hjghes , jn -* human Ussue , CRSp-| 

h-CRSP-li ke . n panial „. J" " " ' aCC " H - '""S- «" ^e>al muscle. 
' * with accession number AA3 JCTT "" ^ *" ' *T done 
'MACE coupon and ^J^^T *— *» 

depwed in Fig„ rc j. ™' y '° define "» hCRSP-li|ce.„ seqittnce 

human CR S p'TTT t ° flheMin ° Mid! ^ ue ° CK »fta'nan CRSP 1 h „ 

man CKiP-3, Md humm c man (.KiP-1, human CRSp _ 2 

« conserve, bertveen hlman CRSp ^T^- * A«ino ooid residues wW ch 
cys.e,„e residues u* i„ dicaled „ y „ J££ ^ « <»»e d . The 2 „ conserved 

crd., Md CRD . 2 The SST""^ don,ains - **~> - 

25 t PreSOT are depiced in F%ure 7 °1* ''"P" CRSP P™** of the 



WO 98/46755 



PCT/US98/07894 



-87 



^TgCC h T IOgieS W6re determinCd USing thC ALIGN ^ ram ' (version 

2.0) GCG software package) using a PAM120 weight residue table, a gap length 

penalty of 12, and a gap penalty of 4. 8 



> Table 2: 



CRSP-1 


CRSP-1 

100 


CRSP-2 

30.0 


CRSP-3 


CRSP-4 


mdkk-1 


xdkk-1 


1 CLFEST 


CRSP-2 




100 


37.2 
43.0 


34.7 
35.9 


31.5 
38.8 


45.4 
38.4 


58.8 
36.7 


CRSP-3 






100 


59.3 


66.4 


53.7 


32.1 


CRSP-4 








100 


38.8 


38.4 


36.7 



Nucleotide percent homologies were determined using the using the Wilbur Lipman 
^gnment program, Ktuple: 3; Gap Penalty: 3; Window: 20. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routme expenmentation, many equivalents to the specific embodiments of the invention 
descnbed herem. Such equivalents are intended to be encompassed by the following 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: MILLENNIUM B IOTHERAPEUTI CS INC 

(B) STREET: 620 MEMORIAL DRIVE 

(C) CITY: CAMBRIDGE 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: US 

(F) POSTAL CODE: 02319 

(G) TELEPHONE: 

(H) TELEFAX: 

(ii) TITLE OF INVENTION: NOVEL CRSP PROTEIN AND NUCLEIC ACID 
MOLECULES AND USES THEREFOR ACID 

(iii) NUMBER OF SEQUENCES : 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD, LLP 

(B) STREET: 28 STATE STREET 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: US 

(F) ZIP: 02109 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE : Patentln Release #i.o, Version #1.25 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US98 / 

(B) FILING DATE: 16 APRIL 1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : US 08/843 704 

(B) FILING DATE: 16 APRIL 1997 

(C) APPLICATION NUMBER: US 08/842 898 

(D) FILING DATE: 17 APRIL 1997 ' 

(E) APPLICATION NUMBER: 60/071 589 

(F) FILING DATE: 15 JANUARY 1998 

(G) APPLICATION NUMBER: 09/009,802 

(H) FILING DATE: 20 JANUARY 1998 

(Viii) ATTORNEY/AGENT INFORMATION : 

(A) NAME: MANDRAGOURAS , AMY E 

(B) REGISTRATION NUMBER: 36,207 

(C) REFERENCE/DOCKET NUMBER: MEI-008CPPC 
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15 (ii> "OLBCULE TYPE : cDNA 

FEATURE: 

J; *W»/«y, CDS 
(B> LOC ATION: 38.. 1087 

t«) SEQUENCE D ESC RIPTION: ggg ffl 
GGCACGAGGG GGCGGCGGCT GCGGGCGCAG AGCGGAG ATG CAG 

103 



35 



40 



50 



_ 3 *-"=u lily 

-= B ====SBsasss.-- 

> CCC rr-n 15 3 Pr ° Ala 

<-CC GCT CCG ACf 20 

30 G1> " Pro Ala Leu 

■ 5 5 2 =S225J- 

95 Thr Thr Asp Thr 

GTT GGA AAT AAT »~ 100 
*» V.1 Gly ^ £ ££ JTC CAT GTG CAC CGA GAA ATT Cfl 

e So val His *■ £ £ 2 s 

115 S 



151 



199 



247 



343 



391 
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ACC AAC AAC CAG ACT GGA CAA ATG GTC TTT TCA GAG ACA GTT ATC ACA 
Thr Asn Asn Gin Thr Gly Gin Met Val Phe Ser Glu Thr Val lie Thr 



5 



15 



20 



35 



40 



45 



50 



439 



487 



583 



631 



679 



727 



120 125 130 

TCT GTG GGA GAC GAA GAA GGC AGA AGG AGC CAC GAG TGC ATC ATC GAC 
Ser Val Gly Asp Glu Glu Gly Arg Arg Ser His Glu Cys He lie 2p 
" 140 145 150 

10 T° CCC AGC ATG TAC TGC CAG TTT GC = AGC TTC CAG TAC 535 

10 Glu Asp cys Gly Pro Ser Met Tyr Cys Gin Phe Ala Ser Phe Gin Tyr 
155 160 1S5 

ACC TGC CAG CCA TGC CGG GGC CAG AGG ATG CTC TGC ACC CGG GAC AGT 
Thr Cys Gin Pro Cys Arg Gly Gin Arg Met Leu Cys Thr Arg Asp Ser 
170 175 i8o 

GAG TGC TGT GGA GAC CAG CTG TGT GTC TGG GGT CAC TGC ACC AAA ATG 
Glu Cys cys Gly Asp Gin Leu Cys Val Trp Gly His Cys Thr Lys M^ 
85 190 195 

GCC ACC AGG GGC AGC AAT GGG ACC ATC TGT GAC AAC CAG AGG GAC TGC 
Ala Thr Arg Gly Ser Asn Gly Thr He Cys Asp Asn Gin Arg Asp Cys 

25 CAG CCG GGG CTG TGC TGT GCC TTC CAG AGA GGC CTG CTG TTC CCT GTG 
Gin Pro Gly Leu Cys Cys Ala Phe Gin Arg Gly Leu Leu Phe Pro Val 
215 220 225 2 3 0 

30 T r? C tu A CCC GTG GAG GGC GAG CTT TGC GAC CCC GCC AGC 775 

30 cys Thr Pro Leu Pro Val Glu Gly Glu Leu Cys His Asp Pro Ala Ser 

235 240 245 

CGG CTT CTG GAC CTC ATC ACC TGG GAG CTA GAG CCT GAT GGA GCC TTG 
Arg Leu Leu Asp Leu He Thr Trp Glu Leu Glu Pro Asp Gly Ala Leu 
250 255 260 

GAC CGA TGC CCT TGT GCC AGT GGC CTC CTC TGC CAG CCC CAC AGC CAC 
Asp Arg Cys Pro Cys Ala Ser Gly Leu Leu Cys Gin Pro His Ser His 
265 270 275 

AGC CTG GTG TAT GTG TGC AAG CCG ACC TTC GTG GGG AGC CGT GAC CAA 
Ser Leu Val Tyr Val Cys Lys Pro Thr Phe Val Gly Ser Arg Asp Gin 
280 285 290 

GAT GGG GAG ATC CTG CTG CCC AGA GAG GTC CCC GAT GAG TAT GAA GTT 
Asp Gly Glu lie Leu Leu Pro Arg Glu Val Pro Asp Glu Tyr SI Val 
295 300 305 * 310 

GGC AGC TTC ATG GAG GAG GTG CGC CAG GAG CTG GAG GAC CTG GAG AGG 1015 
Gly Ser Phe Met Glu Glu Val Arg Gin Glu Leu Glu Asp Leu Glu Arg 
315 320 325 

AGC CTG ACT GAA GAG ATG GCG CTG AGG GAG CCT GCG GCT GCC GCC GCT 1063 
Ser Leu Thr Glu Glu Met Ala Leu Arg Glu Pro Ala Ala Ala lit aS 
330 335 340 



823 



871 



919 



967 
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S S 2 5 2 Si £ « *™» CCAGGCTGTG GGTAGATCTG 1117 



5 



345 



350 



CAATAGAAAT AGCTAATTTA TTTCCCCANG TGTGTGCTTT AAGCGTGGGC TGACCAGGCT 1177 
TCTTCCTACA TCTTCTTCCC AGTAAGTTTC CCCTCTGGCT TGACAGCATG AGGTGTTGTG 1 237 
10 CATTTGTTCA GCTCCCCCAG GC^rTCTCC AGGCTTCACA GTCTGGTGCT TGGGAGAGTC 1297 
AGGCAGGGTT AAACTGCAGG AGCAGTTTGC CACCCCTGTC CAGATTATTG GCTGCTTTGC 1357 
15 CTCTACCAGT ^OCAGACAG CCGTTTGTTC TACATGGCTT TGATAATTGT TTGAGGGGAG 1417 
GAGATGGAAA CAATGTGGAG TCTCCCTCTG ATTGGTTTTG GGGAAATGTG GAGAAGAGTG 1477 
CCCTGCTTTG CAAACATCAA CCTGGCAAAA ATGCAACAAA TGAATTTTCC ACGCAGTTCT 1537 
20 TTCCATGGGC ATAGGTAAGC TGTGCCTTCA GCTGTTGCAG ATGAAATGTT CTGTTCACCC 15 97 
TGCATTACAT GTGTTTATTC ATCCAGCAGT GTTGCTCAGC TCCTACCTCT GTGCCAGGGC 1657 
^ AGCATTTTCA TATCCAAGAT CAATTCCCTC TCTCAGCACA GCCTGGGGAG GGGGTCATTG 1717 
TTCTCCTCGT CGATCAGGGA TTTCAGAGGC TCAGAGACTG CAAGCTGCTT GCCCAAGTCA 1777 
CACAGCTAGT GAAGACCAGA GCAGTTTCAT CTGGTTGTGA CTCTAAGCTC AGTGCTCTCT 1837 
30 CCACTACCCC ACACCAGCCT TGGTGCCACC AAAAGTGCTC CCCAAAAGGA AGGAGAATGG 1897 
GATTTTTCTT TTGAGGCATG CACATCTGGA ATTAAGGTCA AACTAATTCT CACATCCCTC 1957 
^ TAAAAGTAAA CTACTGTTAG GAACAGCAGT GTTCTCACAG TGTGGGGCAG CCGTCCTTCT 20 1 7 
AATGAAGACA ATGATATTGA CACTGTCCCT CTTTGGCAGT TGCATTAGTA ACTTTGAAAG 2077 
GTATATGACT GAGCGTAGCA TACAGGTTAA CCTGCAGAAA CAGTACTTAG GTAATTGTAG 2137 
40 GGCGAGGATT ATAAATGAAA TTTGCAAAAT CACTTAGCAG CAACTGAAGA CAATTATCAA 2197 
CCACGTGGAG AAAATCAAAC CGAGCAGGGC TGTGTGAAAC ATGGTTGTAA TATGCGACTG 22S7 
^ CGAACACTGA ACTCTACGCC ACTCCACAAA TGATGTTTTC AGGTGTCATG GACTGTTGCC 2317 
ACCATGTATT CATCCAGAGT TCTTAAAGTT TAAAGTTGCA CATGATTGTA TAAGCATGCT 2377 
TTCTTTGAGT TTTAAATTAT GTATAAACAT AAGTTGCATT TAGAAATCAA GCATAAATCA 2437 
50 CTTCAACTGC TAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 



2479 



WO 98/46755 

-92- 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



PCT/US98/07894 



2: 



Met Gin Arg Leu Gly Ma Thr Leu Leu cys ftia ^ a ^ 

10 15 
15 Val Pro Thr «, Pro Ala Pro Ma pro Thr Ma ^ Ma ^ v ^ 



20 25 

^ 5 30 



I*. Pro cay Pro Ala Leu Ser Tyr PrQ ^ ^ ^ ^ ^ ^ ftsn 
20 40 45 



Glu Met Phe Arg Olu val Glu Glu Leu Met Glu Asp Thr Gln His Lyg 

b5 60 

25 Leu Ar g ser A l a Val Glu Glu Met Glu A la Glu Glu Ala Ma Ala Lys 

75 

Ala Ser Ser Glu Val Asn Leu Ala Asn Leu Pro Pro Ser T yr His Asn 
85 90 95 

30 Glu Thr Asn Thr AS p Thr Asn val Gly Asn Asn ^ 



105 110 



35 120 125 



Ser Glu Thr Val xie Thr Ser Val Gl y Asp Glu Glu Gly Ar 3 Ar g Ser 

135 140 

40 145 ^ " e " e S G1U ^ <*» ay Ser Met Tyr Cys Gin 

155 160 

Phe Ala ser Phe Gin Tyr Thr c ys Gin Pro C^s Ar g Gly Gin Ar g Met 

170 1?5 
45 Leu C.3 Thr J, Asp ser Qlu cys ^ ^ ^ v ^ ^ p 



185 190 
Gly His ays Th r Lys „ et Ala Thr Ar g Gly Ser Asn Gly Thr „. Cys 
50 200 205 

ASP Asn Gin Ar g Asp Cys Gin Pro Gl y Leu CyS Cys Ala Phe Gin Ar g 

220 

^ Gly Leu Leu Phe Pro Val Cys Thr Pro Leu Pro Val Glu Gly Glu Leu 

235 240 



25 



30 



WO 98/46755 
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-93- 

Cys His Asp Pro A i a „„„ R 

Ala Ser Arg Leu Leu Asp Leu He Th. * 

245 ljeu ri e Thr Trp Glu Leu 

5 Glu Pro Asp Gly Ala Lpi ; » 255 

«" " P " 9 a <*• «• ~ «y u. ,« u 

,0 " Gln •» " is ~ ~ s " JS ~ - « - - - - ». 

« S - „ oly „„ ne L . u ^ ™ ^ 

300 

15 - vp «. ^ 01u s Qly s „ phe ^ ^ mu w ^ Ma ^ 

20 Pro Ala Ala Ala Ala Ala Ala t , ^ 
340 Ala Ala Leu Leu Gly Arg Glu Glu H e 

345 

(2) INFORMATION FOR SEQ ID NO:3: 

<i) SEQUENCE CHARACTERISTICS • 

A) LENGTH: loso base pairs 
B TYPE: nucleic acid 
C STRANDEDNESS: single 

topology: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE : 
35 (A > NAME/ KEY : CDS 

(B) LOCATION: 1.. 10 5 0 



<*±> SEQUENCE DESCRIPTION: SEQ I D N0 . 3 . 

- = = 5 = 5 = = = = - - - - - - s 

- S S S 2 SI giu % 2 « - « ~ «. « » 

55 Sp Thr G ^n His Lys 

fin 



45 



50 



48 



96 



144 



192 



WO 98/46755 
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-94- 

TTG CGC AGC GCG GTG GAA GAG ATG GAG GCA GAA GAA GCT GCT GCT AAA 24 0 

Leu Arg Ser Ala Val Glu Glu Met Glu Ala Glu Glu Ala Ala Ala Lvs 
65 7 ° 75 80 

5 GCA TCA TCA GAA GTG AAC CTG GCA AAC TTA CCT CCC AGC TAT CAC AAT 288 
Ala Ser Ser Glu Val Asn Leu Ala Asn Leu Pro Pro Ser Tyr His Asn 
85 90 95 

GAG ACC AAC ACA GAC ACG AAC GTT GGA AAT AAT ACC ATC CAT GTG CAC 336 
Glu Thr Asn Thr AS P T hr Asn Val Gly Asn Asn Thr He His Val His 
100 105 110 



CGA GAA ATT CAC AAG ATA ACC AAC AAC CAG ACT GGA CAA ATG GTC TTT 
Arg Glu He His Lys He Thr Asn Asn Gin Thr Gly Gin Met Val Phe 
' 115 120 125 

TCA GAG ACA GTT ATC ACA TCT GTG GGA GAC GAA GAA GGC AGA AGG AGC 
Ser Glu Thr Val He Thr Ser Val Gly Asp Glu Glu Gly Arg Arg Ser 
130 135 140 

CAC GAG TGC ATC ATC GAC GAG GAC TGT GGG CCC AGC ATG TAC TGC CAG 
His Glu Cys He He Asp Glu Asp Cys Gly Pro Ser Met Tyr Cys Gin 
145 150 155 i 60 

TTT GCC AGC TTC CAG TAC ACC TGC CAG CCA TGC CGG GGC CAG AGG ATG 
Phe Ala Ser Phe Gin Tyr Thr Cys Gin Pro Cys Arg Gly Gin Arg Met 
16S 170 175 

CTC TGC ACC CGG GAC AGT GAG TGC TGT GGA GAC CAG CTG TGT GTC TGG 
Leu Cys Thr Arg Asp Ser Glu Cys Cys Gly Asp Gin Leu Cys Val Trp 
180 185 190 

GGT CAC TGC ACC AAA ATG GCC ACC AGG GGC AGC AAT GGG ACC ATC TGT 
Gly His Cys Thr Lys Met Ala Thr Arg Gly Ser Asn Gly Thr He Cys 
195 200 205 



GAC AAC CAG AGG GAC TGC CAG CCG GGG CTG TGC TGT GCC TTC CAG AGA 
Asp Asn Gin Arg Asp Cys Gin Pro Gly Leu Cys Cys Ala Phe Gin Arg 
210 215 220 



364 



432 



480 



528 



576 



624 



672 



720 



768 



GGC CTG CTG TTC CCT GTG TGC ACA CCC CTG CCC GTG GAG GGC GAG CTT 
Gly Leu Leu Phe Pro Val Cys Thr Pro Leu Pro Val Glu Gly Glu Leu 
225 23 ° 235 240 

TGC CAT GAC CCC GCC AGC CGG CTT CTG GAC CTC ATC ACC TGG GAG CTA 
Cys H lS Asp Pro Ala Ser Arg Leu Leu Asp Leu He Thr Trp Glu Leu 
245 250 255 

GAG CCT GAT GGA GCC TTG GAC CGA TGC CCT TGT GCC AGT GGC CTC CTC 816 
Glu Pro Asp Gly Ala Leu Asp Arg Cys Pro Cys Ala Ser Gly Leu Leu 
260 265 270 

TGC CAG CCC CAC AGC CAC AGC CTG GTG TAT GTG TGC AAG CCG ACC TTC 864 
Cys Gin Pro His Ser His Ser Leu Val Tyr Val Cys Lys Pro Thr Phe 
275 280 285 



WO 98/46755 
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-95- 



GTG GGG AGO CGT GAC CAA GAT GGG GAG ATC CTG CTG CCC AGA GAG GTC 
Val Gly Ser Ar 9 Asp Gin Asp Gly Glu He Leu Leu Pro Jrg Val 

5 



912 



295 300 



CCC GAT GAG TAT GAA GTT GGC AGC TTC ATG GAG GAG GTG CGC CAG GAG 960 
Pro Asp Glu Tyr Glu Val Gly Ser Phe Met Glu Glu Val Arg SS Glu 
310 315 320 

10 CTG GAG GAC CTG GAG AGG AGC CTG ACT GAA GAG ATG GCG CTG AGG GAG 1008 
Leu Glu Asp Leu Glu Arg Ser Leu Thr Glu Glu Met Ala Leu A^g tlu 
325 "0 33 5 

15 S aS S !S S 27 ^ f TG ™ 5* AGG ™* «« «T 



Pro Ala Ala Ala Ala Ala Ala Leu Leu Gly Arg Glu Glu He 

350 



340 345 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 848 base pairs 

(B) TYPE: nucleic acid 

0 (C) STRANDEDNESS : single 

25 (D) TOPOLOGY: linear 



30 



35 



45 



(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 125.. 796 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GAATTCGGCA CGAGAGACGA CGTGCTGAGC TGCCAGCTTA GTGGAAGCTC TGCTCTGGGT 
GGAGAGCAGC CTCGCTTTGG TGACGCACAG TGCTGGGACC CTCCAGGAGC CCCGGGATTG 



60 
120 

40 GTC CTG CTG GGG CTG AGC ^G CTC TGC TCT CCC 169 

40 Met Val Ala Ala Val Leu Leu Gly Leu Ser Trp Leu Cys Ser Pro 



5 « 15 



*7 CTG GAC TTC MC ^ C ATC AGC TCT GCT GAC 

Leu Gly Ala Leu Val Leu Asp Phe Asn Asn He Arg Ser Ser Ala Is? 
20 25 



30 



CTG CAT GGG GCC CGG AAG GGC TCA CAG TGC CTG TCT GAC ACQ GAC TGC 
Leu hx. Gly Ala Arg Lys Gly Ser Gin Cys Leu Ser Asp J£ Sp Jys 
50 5 40 45 

AAT ACC AGA AAG TTC TGC CTC CAG CCC CGC GAT GAG AAG CCG TTC TGT 
Asn Thr Arg Lys Phe Cys Leu Gin Pro Arg Asp Glu Lys Pro Jhe £s 
50 55 1 



217 



265 



313 



60 
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-96- 

GCT ACA TGT CGT GGG TTG CGG AGG AGG TGC CAG CGA GAT GCC ATG TGC 361 
.Ala Thr Cys Arg Gly Leu Arg Arg Arg Cys Gin Arg Asp Ala Met Cys 
65 70 75 

TGC CCT GGG ACA CTC TGT GTG AAC GAT GTT TGT ACT ACQ ATG GAA GAT 4 09 

Cys Pro Gly Thr Leu Cys val Asn Asp Val Cys Thr Thr Met Glu Asp 
80 85 90 95 

GCA ACC CCA ATA TTA GAA AGG CAG CTT GAT GAG CAA GAT GGC ACA CAT 4 57 

Ala Thr Pro He Leu Glu Arg Gin Leu Asp Glu Gin Asp Gly Thr His 
100 105 no 

GCA GAA GGA ACA ACT GGG CAC CCA GTC CAG GAA AAC CAA CCC AAA AGG 505 
Ala Glu Gly Thr Thr Gly His Pro Val Gin Glu Asn Gin Pro Lys Arg 
115 120 125 

AAG CCA AGT ATT AAG AAA TCA CAA GGC AGG AAG GGA CAA GAG GGA GAA 5 53 

Lys Pro Ser He Lys Lys Ser Gin Gly Arg Lys Gly Gin Glu Gly Glu 
130 135 140 

AGT TGT CTG AGA ACT TTT GAC TGT GGC CCT GGA CTT TGC TGT GCT CGT 601 
Ser Cys Leu Arg Thr Phe Asp Cys Gly Pro Gly Leu Cys Cys Ala Arg 
145 iso 155 

CAT TTT TGG ACG AAA ATT TGT AAG CCA GTC CTT TTG GAG GGA CAG GTC 64 9 

His Phe Trp Thr Lys He Cys Lys Pro Val Leu Leu Glu Gly Gin Val 
160 165 170 175 

TGC TCC AGA AGA GGG CAT AAA GAC ACT GCT CAA GCT CCA GAA ATC TTC 6 97 

Cys Ser Arg Arg Gly His Lys Asp Thr Ala Gin Ala Pro Glu He Phe 
180 185 190 

CAG CGT TGC GAC TGT GGC CCT GGA CTA CTG TGT CGA AGC CAA TTG ACC 74 5 

Gin Arg Cys Asp Cys Gly Pro Gly Leu Leu Cys Arg Ser Gin Leu Thr 
195 200 205 

AGC AAT CGG CAG CAT GCT CGA TTA AGA GTA TGC CAA AAA ATA GAA AAG 793 
Ser Asn Arg Gin His Ala Arg Leu Arg Val Cys Gin Lys He Glu Lys 
210 215 220 

CTA TAAATATTTC AAAATAAAGA AGAATCCACA TTGCAAAAAA AAAAAAAAAA AA 848 
Leu 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



WO 98/46755 
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-97- 

Met val Ala Ala Val Leu Leu Gly Leu Ser Trp Leu Cys Ser Pro Leu 
5 10 15 

Gly Ala Leu Val Leu Asp Phe Asn Asn lie Arg Ser Ser Ala Asp Leu 
20 25 30 

His Gly Ala Arg Lys Gly Ser Gin Cys Leu Ser Asp Thr Asp Cys Asn 

40 45 

10 Thr Arg Lys Phe Cys Leu Gin Pro Arg Asp Glu Lys Pro Phe Cys Ala 
50 55 60 

Thr Cys Arg Gly Leu Arg Arg Arg Cys Gin Arg Asp Ala Met Cys Cys 
15 70 75 80 

Pro Gly Thr Leu Cys Val Asn Asp Val Cys Thr Thr Met Glu Asp Ala 



85 9° 95 



20 ^ ^" G1U Arg Gln Leu As P G1 " Gin Asp Gly Thr His Ala 

100 105 110 

Glu Gly Thr Thr Gly His Pro Val Gin Glu Asn Gin Pro Lys Arg Lys 
115 120 125 

25 Pro Ser lie Lys Lys Ser Gin Gly Arg Lys Gly Gin Glu Gly Glu Ser 



35 



135 140 
Cys Leu Arg Thr Phe Asp Cys Gly Pro Gly Leu Cys Cys Ala Arg His 
30 150 155 160 

Phe Trp Thr Lys He Cys Lys Pro Val Leu Leu Glu Gly Gin Val Cys 
165 170 1?5 

Ser Arg Arg Gly His Lys Asp Thr Ala Gin Ala Pro Glu He Phe Gin 
180 185 190 

Arg Cys Asp Cys Gly Pro Gly Leu Leu Cys Arg Ser Gin Leu Thr Ser 



195 200 



205 



40 Asn Arg Gin His Ala Arg Leu Arg Val Cys Gin Lys He Glu Lys Leu 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 672 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



10 



WO 98/46755 PCI7US98/07894 

-98- 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

ATG GTG GCG GCC GTC CTG CTG GGG CTG AGC TGG CTC TGC TCT CCC CTG 
Met val Ala Ala Val Leu Leu Gly Leu Ser Trp Leu Cys Ser Pro Leu 
15 10 15 

GGA GCT CTG GTC CTG GAC TTC AAC AAC ATC AGG AGC TCT GCT GAC CTG 
Gly Ala Leu Val Leu Asp Phe Asn Asn He Arg Ser Ser Ala Asp Leu 
20 25 30 



50 



48 



96 



144 



192 



240 



28S 



336 



15 CAT GGG GCC CGG AAG GGC TCA CAG TGC CTG TCT GAC ACG GAC TGC AAT 
His Gly Ala Arg Lys Gly Ser Gin Cys Leu Ser Asp Thr Asp Cys Asn 
35 40 45 

ACC AGA AAG TTC TGC CTC CAG CCC CGC GAT GAG AAG CCG TTC TGT GCT 
20 Thr Arg Lys Phe Cys Leu Gin Pro Arg Asp Glu Lys Pro Phe Cys Ala 
SO 55 6° 

ACA TGT CGT GGG TTG CGG AGG AGG TGC CAG CGA GAT GCC ATG TGC TGC 
Thr cys Arg Gly Leu Arg Arg Arg Cys Gin Arg Asp Ala Met Cys Cys 
25 65 ' 70 75 80 

CCT GGG ACA CTC TGT GTG AAC GAT GTT TGT ACT ACG ATG GAA GAT GGA 
Pro Gly Thr Leu Cys Val Asn Asp Val Cys Thr Thr Met Glu Asp Ala 
85 90 95 

30 

ACC CCA ATA TTA GAA AGG CAG CTT GAT GAG CAA GAT GGC ACA CAT GCA 
Thr Pro He Leu Glu Arg Gin Leu Asp Glu Gin Asp Gly Thr Hxs Ala 
100 105 HO 

35 GAA GGA ACA ACT GGG CAC CCA GTC CAG GAA AAC CAA CCC AAA AGG AAG 
Glu Gly Thr Thr Gly His Pro Val Gin Glu Asn Gin Pro Lys Arg Lys 
US 120 I 25 

CCA AGT ATT AAG AAA TCA CAA GGC AGG AAG GGA CAA GAG GGA GAA AGT 
40 Pro Ser He Lys Lys Ser Gin Gly Arg Lys Gly Gin Glu Gly Glu Ser 
130 "5 140 

TGT CTG AGA ACT TTT GAC TGT GGC CCT GGA CTT TGC TGT GCT CGT CAT 
Cys Leu Arg Thr Phe Asp Cys Gly Pro Gly Leu Cys Cys Ala Arg His 
45 i« 150 I" 160 

TTT TGG ACG AAA ATT TGT AAG CCA GTC CTT TTG GAG GGA CAG GTC TGC 528 
Phe Trp Thr Lys He Cys Lys Pro Val Leu Leu Glu Gly Gin Val Cys 
165 l -70 175 



384 



432 



480 



TCC AGA AGA GGG CAT AAA GAC ACT GCT CAA GCT CCA GAA ATC TTC CAG 576 
Ser Arg Arg Gly His Lys Asp Thr Ala Gin Ala Pro Glu He Phe Gin 
180 185 190 
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-99- 

CGT TGC GAC TGT GGC CCT GGA CTA CTG TGT CGA AGC CAA TTG ACC AGC 624 
Arg Cys Asp Cys Gly Pro Gly Leu Leu Cys Arg Ser Gin Leu Thr Ser 
195 200 205 

5 AAT CGG CAG CAT GCT CGA TTA AGA GTA TGC CAA AAA ATA GAA AAG CTA 672 
Asn Arg Gin His Ala Arg Leu Arg Val Cys Gin Lys He Glu Lys Leu 
210 215 220 



10 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

20 <ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 93.. 890 



25 



35 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CCGGACGCGT GGGCGGCACG GTTTCGTGGG GACCCAGGCT TGCAAAGTGA CGGTCATTTT 60 



CTCTTTCTTT CTCCCTCTTG AGTCCTTCTG AG ATG ATG GCT CTG GGC GCA GCG 113 

Met Met Ala Leu Gly Ala Ala 
30 1 5 

GGA GCT ACC CGG GTC TTT GTC GCG ATG GTA GCG GCG GCT CTC GGC GGC 161 
Gly Ala Thr Arg Val Phe Val Ala Met Val Ala Ala Ala Leu Gly Gly 
10 15 20 



CAC CCT CTG CTG GGA GTG AGC GCC ACC TTG AAC TCG GTT CTC AAT TCC 209 
His Pro Leu Leu Gly Val Ser Ala Thr Leu Asn Ser Val Leu Asn Ser 
25 30 35 



40 AAC GCT ATC AAG AAC CTG CCC CCA CCG CTG GGC GGC GCT GCG GGG CAC 2 57 

Asn Ala lie Lys Asn Leu Pro Pro Pro Leu Gly Gly Ala Ala Gly His 
40 45 50 55 

CCA GGC TCT GCA GTC AGC GCC GCG CCG GGA ATC CTG TAC CCG GGC GGG 305 
45 Pro Gly Ser Ala Val Ser Ala Ala Pro Gly He Leu Tyr Pro Gly Gly 

60 65 70 

AAT AAG TAC CAG ACC ATT GAC AAC TAC CAG CCG TAC CCG TGC GCA GAG 353 
Asn Lys Tyr Gin Thr He Asp Asn Tyr Gin Pro Tyr Pro Cys Ala Glu 
50 " 75 80 85 

GAC GAG GAG TGC GGC ACT GAT GAG TAC TGC GCT AGT CCC ACC CGC GGA 401 

Asp Glu Glu Cys Gly Thr Asp Glu Tyr Cys Ala Ser Pro Thr Arg Gly 
90 95 100 

55 



WO 98/46755 PCT/US98/07894 



- 100- 

GGG GAC GCA GGC GTG CAA ATC TGT CTC GCC TGC AGG AAG CGC CGA AAA 449 
Gly Asp Ala Gly Val Gin lie Cys Leu Ala Cys Arg Lys Arg Arg Lys 
105 110 115 

5 CGC TGC ATG CGT CAC GCT ATG TGC TGC CCC GGG AAT TAC TGC AAA AAT 4 97 

Arg Cys Met Arg His Ala Met Cys Cys Pro Gly Asn Tyr Cys Lys Asn 
120 125 130 135 

GGA ATA TGC GTG TCT TCT GAT CAA AAT CAT TTC CGA GGA GAA ATT GAG 545 
10 Gly He Cys Val Ser Ser Asp Gin Asn His Phe Arg Gly Glu He Glu 
140 145 150 

GAA ACC ATC ACT GAA AGC TTT GGT AAT GAT CAT AGC ACC TTG GAT GGG 5 93 

Glu Thr He Thr Glu Ser Phe Gly Asn Asp His Ser Thr Leu Asp Gly 
15 155 160 165 

TAT TCC AGA AGA ACC ACC TTG TCT TCA AAA ATG TAT CAC ACC AAA GGA 641 
Tyr Ser Arg Arg Thr Thr Leu Ser Ser Lys Met Tyr His Thr Lys Gly 
170 175 180 

20 

CAA GAA GGT TCT GTT TGT CTC CGG TCA TCA GAC TGT GCC TCA GGA TTG 68 9 

Gin Glu Gly Ser Val Cys Leu Arg Ser Ser Asp Cys Ala Ser Gly Leu 
185 190 195 

25 TGT TGT GCT AGA CAC TTC TGG TCC AAG ATC TGT AAA CCT GTC CTG AAA 73 7 

Cys Cys Ala Arg His Phe Trp Ser Lys He Cys Lys Pro Val Leu Lys 
200 205 210 215 

GAA GGT CAA GTG TGT ACC AAG CAT AGG AGA AAA GGC TCT CAT GGA CTA 785 
30 Glu Gly Gin Val Cys Thr Lys His Arg Arg Lys Gly Ser His Gly Leu 

220 225 230 

GAA ATA TTC CAG CGT TGT TAC TGT GGA GAA GGT CTG TCT TGC CGG ATA 833 
Glu He Phe Gin Arg Cys Tyr Cys Gly Glu Gly Leu Ser Cys Arg He 
35 235 240 245 

CAG AAA GAT CAC CAT CAA GCC AGT AAT TCT TCT AGG CTT CAC ACT TGT 881 
Gin Lys Asp His His Gin Ala Ser Asn Ser Ser Arg Leu His Thr Cys 
250 255 260 

40 

CAG AGA CAC TAAAC CAGCT ATCCAAAATG CAGTGAACTC CTTTTATATA 930 
Gin Arg His 
265 

45 ATAGATGCTA TGAAAACCTT TTATGACCTT CATCAACTCA ATCCTAAGGA TATACAAGTT 990 
CTGTGGTTTC AGTTAAGCAT TCCAATAACA CCTTCCAAAA ACCTGGAGTG T AAGAGC TTT 1050 
GTTTCTTTAT GGAACTCCCC TGTGATTGCA GTAAATTACT GTATTGTAAA TTCTCAGTGT 1110 

50 

GGCACTTACC TGTAAATGCA ATGAAACTTT TAATTATTTT TCTAAAGGTG CTGCACTGCC 1170 
TATTTTTCCT CTTGTTATGT AAATTTTTGT ACACATTGAT TGTTAT CTTG ACTGACAAAT 1230 



55 ATTCTATATT GAACTGAAGT AAATCATTTC AGCTTATAGT TCTTAAAAGC ATAACCCTTT 1290 
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ACCCCATTTN ATTCTAGAGT CNAGAACGCA AGGATCTCTT GGAATGACAA ATGATAGGTA 1350 
5 C C TAAAATGT AACATGAAAA TACTAGCTTA TTTTCTGAAA TGTACTATCT TAATGCTTAA 1410 
ATTATATTTC CCTTTAGGCT GTGATAGTTT TTGAAATAAA ATTTAACATT TAATATCATG 1470 
AAATG KT AT A AGTAGACATA AAAAAAAAAA AAAAAAAAAA AGGGCGGCCG CTAGACTAG 1529 

10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (*i) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met Met Ala Leu Gly Ala Ala Gly Ala Thr Arg Val Phe Val Ala Met 
1 . 5 10 15 

25 Val Ala Ala Ala Leu Gly Gly His Pro Leu Leu Gly Val Ser Ala Thr 
20 25 30 

Leu Asn Ser Val Leu Asn Ser Asn Ala lie Lys Asn Leu Pro Pro Pro 

30 

Leu Gly Gly Ala Ala Gly His Pro Gly Ser Ala Val Ser Ala Ala Pro 
50 55 60 

Gly lie Leu Tyr Pro Gly Gly Asn Lys Tyr Gin Thr He Asp Asn Tyr 

° 75 80 

Gin Pro Tyr Pro Cys Ala Glu Asp Glu Glu Cya Gly Thr Asp Glu Tyr 
85 90 95 

40 Cys Ala Ser Pro Thr Arg Gly Gly Asp Ala Gly Val Gin lie Cys Leu 
100 105 no 

Ala Cys Arg Lys Arg Arg Lys Arg Cys Met Arg His Ala Met Cys Cys 
45 115 120 125 

Pro Gly Asn Tyr Cys Lys Asn Gly He Cys Val Ser Ser Asp Gin Asn 
130 135 i 4 o 

His Phe Arg Gly Glu He Glu Glu Thr He Thr Glu Ser Phe Gly Asn 
50 145 «0 155 i 6 o 

Asp His Ser Thr Leu Asp Gly Tyr Ser Arg Arg Thr Thr Leu Ser Ser 
16 5 170 175 
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Lys Met Tyr His Thr Lys Gly Gin Glu Gly Ser Val Cys Leu Arg Ser 

165 19Q 

5 Ser Asp cys Ala Ser Gly Leu Cys Cys Ala Arg His Phe Trp Ser Lys 

200 205 

He Cys Lys Pro Val Leu Lys Glu Gly Gin Val Cys Thr Lys His Arg 

215 220 
10 Arg Lys Gly Ser His Gly Leu Glu He Pne Gin Arg Cys Tyr Cys Gly 



235 



240 



Glu Gly Leu Ser Cys Arg He Gin Lys Asp His His Gin Ala Ser Asn 
15 250 255 

Ser Ser Arg Leu His Thr Cys Gin Arg His 



260 265 



20 



(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 
?s < c > S TRANDEDNES S : single 
° <D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..798 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:9 : 



35 SS = =522; =S -- = - 2 - 

5 10 15 

<« £ E E E = E E E E = - - - - - - 

A U o c 



25 30 



45 



= = = S = = = = S X Z E 2 E £ - 



40 45 



= S 5 E = E E 2 5 - S S S E E E 

50 55 co 

Glv ill 7° GGG **» TAC CM ACC ATT GAC AAC TAC 

Gly He Leu Tyr Pro Gly Gly Asn Lys Tyr Gin Thr xl l £ ™ 



48 



96 



144 



192 



240 



80 
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- 103 - 

CAG CCG TAC CCG TGC GCA GAG GAC GAG GAG TGC GGC ACT GAT GAG TAC 288 
Gin Pro Tyr Pro Cys Ala Glu Asp Glu Glu Cys Gly Thr Asp Glu Tyr 
85 90 95 

TGC GCT AGT CCC ACC CGC GGA GGG GAC GCA GGC GTG CAA ATC TGT CTC 3 36 

Cys Ala Ser Pro Thr Arg Gly Gly Asp Ala Gly Val Gin He Cys Leu 
100 105 no 

GCC TGC AGG AAG CGC CGA AAA CGC TGC ATG CGT CAC GCT ATG TGC TGC 384 
Ala Cys Arg Lys Arg Arg Lys Arg Cys Met Arg His Ala Met Cys Cys 
115 120 125 

CCC GGG AAT TAC TGC AAA AAT GGA ATA TGC GTG TCT TCT GAT CAA AAT 4 32 

Pro Gly Asn Tyr Cys Lys Asn Gly He Cys Val Ser Ser Asp Gin Asn 
130 135 i4 0 

CAT TTC CGA GGA GAA ATT GAG GAA ACC ATC ACT GAA AGC TTT GGT AAT 4 80 

His Phe Arg Gly Glu He Glu Glu Thr He Thr Glu Ser Phe Gly Asn 
145 150 155 160 

GAT CAT AGC ACC TTG GAT GGG TAT TCC AGA AGA ACC ACC TTG TCT TCA 52 8 

Asp His Ser Thr Leu Asp Gly Tyr Ser Arg Arg Thr Thr Leu Ser Ser 
165 170 175 

AAA ATG TAT CAC ACC AAA GGA CAA GAA GGT TCT GTT TGT CTC CGG TCA 576 
Lys Met Tyr His Thr Lys Gly Gin Glu Gly Ser Val Cys Leu Arg Ser 
180 185 190 

TCA GAC TGT GCC TCA GGA TTG TGT TGT GCT AGA CAC TTC TGG TCC AAG 624 
Ser Asp Cys Ala Ser Gly Leu Cys Cys Ala Arg His Phe Trp Ser Lys 
195 200 205 

ATC TGT AAA CCT GTC CTG AAA GAA GGT CAA GTG TGT ACC AAG CAT AGG 6 72 

He Cys Lys Pro Val Leu Lys Glu Gly Gin Val Cys Thr Lys His Arg 
210 215 220 

AGA AAA GGC TCT CAT GGA CTA GAA ATA TTC CAG CGT TGT TAC TGT GGA 72 0 

Arg Lys Gly Ser His Gly Leu Glu He Phe Gin Arg Cys Tyr Cys Gly 
225 230 235 240 

GAA GGT CTG TCT TGC CGG ATA CAG AAA GAT CAC CAT CAA GCC AGT AAT 768 
Glu Gly Leu Ser Cys Arg He Gin Lys Asp His His Gin Ala Ser Asn 
245 250 255 

TCT TCT AGG CTT CAC ACT TGT CAG AGA CAC 7 98 

Ser Ser Arg Leu His Thr Cys Gin Arg His 
260 265 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



WO 98/46755 
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104- 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(XX) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .53 7 



iQ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO* 10* 

= = Z S 5 = 5 - - 5 - - - « « - 

,5 = = = s = = - - 5 s - - - - =: - 

» 5 s 2 =s s s s s s s z z z s s: s 

25 = S = = = S = s - - - - 2 - - - 

55 60 

£ 5 2 S £ S « ~ •» « « TCA « „, Mc TO 

65 P f? Asn Hls Hls Tyr Ser Asn His Asp Leu 

30 70 75 80 

5 5 = = = S S £ = s - - - S - « 
35 5 = = S 5 = - Si S = = - - s £ - 

105 110 

• = £ 5 E JS s = - - - - - - - - s 

120 125 
CAT CAG GGG GAA GTC TGT arn aa* ~ 
, His Gin Gly Glu Val Ss TOr GGT TCT CAT ^ 

45 130 ^ ™* LyS Gln Ar 9 L ^ s Lys Gly ser His Gly 

1 ~ i5 140 

S SI S S £ £ ~ « » « «« « CTG TCT TGC AAA 
145 15 J yS Asp Cys Ala Gly Leu Ser Cys Lys 

>0 155 16Q 

s s £ Asp e s Tyr e s e e a^ ctc « « - 

16S y la Arg Leu His Val Cys 



46 



96 



144 



192 



240 



268 



336 



384 



432 



480 



528 



170 175 



WO 98/46755 

PCT/US98/07894 

-105- 

S £ £ -> — » « 
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TATAGAT GATCACAAAA AAAAAAAAAA AAAAGATGCG SS7 



10 

<2) INFORMATI ^fOR SEQlDNO:11: 

|g (i) S E QUE NCE CHARACTERISTICS • 

TYPE: ai "ino acid 
topology.- ii„ ear 

<ii) MOLECULE TYPE : p rotein 

20 

SEQUENCE D ^RIP TION: SEQiDno . u . 
ASP M ^ Cys Cys Pro Ser T . 30 

30 

Pro Val xhr Glu Ser n e , 45 
50 n e Leu Thr pro 

55 116 P *° Ala Leu Asp Qly 

Thr Arg His Arg Asn av-„ » 
35 65 «. *. „ a Ht . My ^ ^ ^ ^ ^ 

Gly Ttp G ln »,„ Leu so 

» *■ °*° m ° «" - - «. n. 

oiy hi * ° iu s - - - - - - - ., P c, IU 

105 y Ile G1 U Gly 

Phe Cys Cvs a la , 110 
y ys AIa Arg His Ph^ mu 

45 115 S ly * "« <*■ v« 

- J; «, «. v„ cys " ! 

a35 9 LyS ^ s °ly Ser His Ql y 

50 JS ^ G1 * Cys Asp cys Ala Lys 

™ rrp Lys Asp Ala Thr ^ Ser ^ 

^ To Ma tSU Hi * Val cys 

5 ^ Gin Lys n e 1? 5 



702 
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10 



15 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 537 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1. .537 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

» = S 5 S 5 S 5 5 £ £ = s S SK £ 



5 10 is 



25 



ler ser 2* ™ ^ GTG T ° T CGG AGA AAG AAG CGC TGC CAC CGA 

Ser Ser Ala Cys Met Val Cys Arg Ar g Lys Lys Lys Arg Cys ™ £J 



25 



30 



IZ Tlv S CGC TGC *** ^ T «* ATC TGT ATC 

Asp Gly Met Cys Cys Pro Ser Thr Arg Cys Asn Asn Gly He Cys He 

30 40 45 

P™ ^ T ^ AGC ATC TTA ACC CCT CAC ATC CCG GCT CTG GAT GGT 

Pro Val Thr Glu Ser He Leu Thr Pro His He Pro All 2 2p Sy 



60 



35 S S ^ f A ^ CGA CAC GGT <* T TAC TCA AAC CAT GAC TTG 

Thr Arg Hxs Arg Asp Arg Asn His Gly His Tyr Ser Asn His Asp ™ 

70 75 



80 



40 



45 



Tlv S S° f T CTA GGA AGA CCA CAC ACT **» ATG TCA CAT ATA AAA 
Gly Trp Gin Asn Leu Gly Arg Pro His Thr Lys Met Ser His He J£ 



85 90 95 



5 = = 5 5S = 5 2 S = 2 5 5 s « - 

100 105 



110 



= 5 5 S 5 His £ 5 £ £ S 5 S S S = 

50 120 125 

= z s; s = s: s s s: s 5 = s = = 5 



48 



96 



144 



192 



240 



288 



336 



384 



432 



20 
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CTG GAA ATT TTC CAG CGT TGC GAC TGT GCG AAG GGC CTG TCT TGC AAA 480 

Leu Glu lie Phe Gin Arg Cys Asp Cys Ala Lys Gly Leu Ser Cys Lys 
145 150 155 160 



5 GTA TGG AAA GAT GCC ACC TAC TCC TCC AAA GCC AGA CTC CAT GTG TGT 
val Trp Lys Asp Ala Thr Tyr Ser Ser Lys Ala Arg Leu His Val Cys 



1*5 170 



175 



CAG AAA ATT 
10 Gin Lys lie 



(2) INFORMATION FOR SEQ ID NO:13: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



50 55 



60 



50 



528 



537 



(lx) FEATURE: 

(A) NAME /KEY : CDS 
25 (B) LOCATION: 75.. 800 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

^ CTCGAGGCCA AAATTCGGCA CGAGGCCGGG CTGTGGTCTA GCATAAAGGC GGAGCCCAGA 60 

AGAAGGGGCG GGGT ATG GGA GAA GCC TCC CCA CCT GCC CCC GCA AGG CGG 110 
Met Gly Glu Ala Ser Pro Pro Ala Pro Ala Arg Arg 
15 10 

35 CAT CTG CTG GTC CTG CTG CTG CTC CTC TCT ACC CTG GTG ATC CCC TCC 158 
His Leu Leu Val Leu Leu Leu Leu Leu Ser Thr Leu Val lie Pro Ser 
15 20 25 

GCT GCA GCT CCT ATC CAT GAT GCT GAC GCC CAA GAG AGC TCC TTG GGT 206 
4U Ala Ala Ala Pro He His Asp Ala Asp Ala Gin Glu Ser Ser Leu Gly 
30 35 40 

CTC ACA GGC CTC CAG AGC CTA CTC CAA GGC TTC AGC CGA CTT TTC CTG 254 

45 L 45 L6U Gln Ser LeU LSU Gln Gly Phe Ser Arg Leu phe Leu 



AAA GGT AAC CTG CTT CGG GGC ATA GAC AGC TTA TTC TCT GCC CCC ATG 302 
Lys Gly Asn Leu Leu Arg Gly He Asp Ser Leu Phe Ser Ala Pro Met 
65 70 75 

GAC TTC CGG GGC CTC CCT GGG AAC TAC CAC AAA GAG GAG AAC CAG GAG 350 
Asp Phe Arg Gly Leu Pro Gly Asn Tyr His Lys Glu Glu Asn Gln Glu 
80 85 90 
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CAC CAG CTG GGG AAC AAC ACC CTC TCC AGC CAC CTC CAG ATC GAC AAr 
Hi. Gin Leu Gly Asn Asn Thr Leu Ser Ser His Leu Gin lit 2p 
95 ioo 105 

ATG ACC GAC AAC AAG ACA GGA GAG GTG CTG ATC TCC GAG AAT GTG GTG 
Met Thr Asp Asn Lys Thr Gly Glu Val Leu He Ser Glu £n £? vll 

115 120 



210 215 



220 



35 



40 



45 



50 



55 



TAGCCCCCAT CAGACCCTGC CCCAAGCACC ATATGGAAAT AAAGTTCTTT C TTACATCTA 
AAAAAAAAAA AAAAAAAAAA AAAAAAATTG GCGGCCGC 

(2) INFORMATION FOR SEQ ID NO; 14; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



398 



446 



130 "5 140 

CCC AGG ATG GAG GAG AAG GAG GCC CTG GTA CCC ATC CAG AAG GCC ACG 
Pro Arg Met Glu Glu Lys Glu Ala Leu Val Pro He o£ £s S S 
145 150 155 

GAC AGC TTC CAC ACA GAA CTC CAT CCC CGG GTG GCC TTC TGG ATC ATT 
Asp ser Phe His Thr Glu Leu His Pro Arg Val Ala Phe Trp S S 
20 165 170 

AAG CTG CCA CGG CGG AGG TCC CAC CAG GAT GCC CTG GAG GGC GGC CAP 
Lye Leu Pro Arg Arg Arg Ser His Gin Asp Ala Leu Oil Gly Gly Ss 
175 180 185 

^ So S° f ° CGA CAC CGC CTG CAG GCC ATC CGG GAT GGA CTC 

Trp Leu Ser Glu Lys Arg His Arg Leu Gin Ala He Arg Asp Gly 2u 

195 200 

30 5 K s S £ E 5 s = s: S S S-2 = S '« 



542 



590 



638 



686 



782 



TCC CAC TCC AGG CTG TCC CCC CGA AAG ACC CAC TTA CTG TAC ATC CTC 
Ser His Ser Arg Leu Ser Pro Arg Lys Thr His Leu Leu ?yr lie Leu 
225 230 

5 £ S T e C r T Arg G "n 2u 
240 



890 
928 
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M " oly Giu si * s - - - - •» - - - «. « 

» - - - u. s „ ^ ^ „ n . pro ser ^ ai> ^ 
w »• «. 1? «. ^ Ma G1 „ „. s „ s „ wu oiy ^ u oiy ^ 

- - - _ Mo aly ^ s . r ^ ^ pte ^ j; ^ ^ ^ 

60 

15 Is Gly Ile A *P Ser Leu Phe Ser A la Pro Met , 

5 70 Met Phe Arg Gly 

Leu Pro Glv Asn t,,» «• 80 

« L " ° lu 01 " "5 - «■ «. «. u. „ y 

25 - - s «. V>1 ^ „. olu am vu w Ma - 1= 

125 



SI S " ^ «« «y Asp Leu Lys Val 



, «, u uys val p ro Arg |fct ^ 



30 LyS Glu Ala Leu val 



140 



« „ Pto ne G1 „ Ly , Ua ^ ^ ^ ^ 

- - u. s ^ M ua ph . s m Lys l=u pre ~ 

« - - ~ «. u , L . u s oly oly ais itp ^ ™ mu 
„ Ly * s 419 * u oi ° - a - - - - - - - 

205 

~ S ^ v,, tau „„ i Gly fc ^ ^ ^ 

220 



45 ^ LyS Thr ^u Leu Ty r Ile Leu , 

230 Y t le Leu Ar ST Pro Ser Arg 

235 9 

Gin Leu 240 



50 



(2) INFORMATION FOR SEQ ID NO:15; 



<i> SEQUENCE CHARACTERISTICS • 

(A) LENGTH : 726 base pairs 
55 ! nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .726 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATG GGA GAA GCC TCC CCA CCT GCC CCC GCA AGG CGG CAT CTG CTG GTC 48 
Met Gly Glu Ala Ser Pro Pro Ala Pro Ala Arg Arg His Leu Leu Val 
1 5 10 15 



CTG CTG CTG CTC CTC TCT ACC CTG GTG ATC CCC TCC GCT GCA GCT CCT 
Leu Leu Leu Leu Leu Ser Thr Leu Val lie Pro Ser Ala Ala Ala Pro 
20 25 30 



96 



ATC CAT GAT GCT GAC GCC CAA GAG AGC TCC TTG GGT CTC ACA GGC CTC 144 
lie His Asp Ala Asp Ala Gin Glu Ser Ser Leu Gly Leu Thr Gly Leu 
3 5 40 45 

CAG AGC CTA CTC CAA GGC TTC AGC CGA CTT TTC CTG AAA GGT AAC CTG 192 
Gin Ser Leu Leu Gin Gly Phe Ser Arg Leu Phe Leu Lys Gly Asn Leu 
50 55 60 

CTT CGG GGC ATA GAC AGC TTA TTC TCT GCC CCC ATG GAC TTC CGG GGC 240 
Leu Arg Gly He Asp Ser Leu Phe Ser Ala Pro Met Asp Phe Arg Gly 
65 70 75 " 80 

CTC CCT GGG AAC TAC CAC AAA GAG GAG AAC CAG GAG CAC CAG CTG GGG 288 
Leu Pro Gly Asn Tyr His Lys Glu Glu Asn Gin Glu His Gin Leu Gly 
S5 90 95 

AAC AAC ACC CTC TCC AGC CAC CTC CAG ATC GAC AAG ATG ACC GAC AAC 336 
Asn Asn Thr Leu Ser Ser His Leu Gin He Asp Lys Met Thr Asp Asn 
100 105 no 

AAG ACA GGA GAG GTG CTG ATC TCC GAG AAT GTG GTG GCA TCC ATT CAA 384 
Lys Thr Gly Glu Val Leu He Ser Glu Asn Val Val Ala Ser He Gin 
115 120 125 

CCA GCG GAG GGG AGC TTC GAG GGT GAT TTG AAG GTA CCC AGG ATG GAG 432 
Pro Ala Glu Gly Ser Phe Glu Gly Asp Leu Lys Val Pro Arg Met Glu 
130 135 140 

GAG AAG GAG GCC CTG GTA CCC ATC CAG AAG GCC ACG GAC AGC TTC CAC 480 
Glu Lys Glu Ala Leu Val Pro He Gin Lys Ala Thr Asp Ser Phe His 
145 150 155 160 

ACA GAA CTC CAT CCC CGG GTG GCC TTC TGG ATC ATT AAG CTG CCA CGG 528 
Thr Glu Leu His Pro Arg Val Ala Phe Trp He He Lys Leu Pro Arg 
165 170 175 
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CGG AGG TCC CAC CAG GAT GCC CTG GAG GGC GGC 
Arg Arg Ser His Gin Asp Ala Leu Glu Gly Gly 
180 1B5 

5 AAG CGA CAC CGC CTG CAG GCC ATC CGG GAT GGA 
Lys Arg His Arg Leu Gin Ala lie Arg Asp Gly 
195 200 

CAC AAG GAC GTC CTA GAA GAG GGG ACC GAG AGC 
IV His Lys Asp Val Leu Glu Glu Gly Thr Glu Ser 
21 ° 215 

CTG TCC CCC CGA AAG ACC CAC TTA CTG TAC ATC 
Leu Ser Pro Arg Lys Thr His Leu Leu Tyr lie 
° 225 230 235 

CAG CTG 
Gin Leu 



CAC TGG CTC AGC GAG 
His Trp Leu Ser Glu 
190 

CTC CGC AAG GGG ACC 
Leu Arg Lys Gly Thr 
205 

TCC TCC CAC TCC AGG 
Ser Ser His Ser Arg 
220 

CTC AGG CCC TCT CGG 
Leu Arg Pro Ser Arg 
240 



20 



25 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 16: 

. (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 110.. 1156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
FGTCGACCCA CGCGTCCGCT GTGGCAGCCC AGCTACCGGT 
AGCTCAGCTT TGTTCATTCG AATTGGGCGG CGGCCAGCGC 



CGG CTC GGG GOT ATT TTG CTG TGT ACA CTG CTG 
Arg Leu Gly Gly He Leu Leu Cys Thr Leu Leu 
45 5 10 



50 



ACT GCT CCT GCT CCT TCC CCG ACG GTC ACT TGG 
Thr Ala Pro Ala Pro Ser Pro Thr Val Thr Trp 
20 25 

GGC CCA GCT CTC AAC TAC CCT CAG GAG GAA GCT 
Gly Pro Ala Leu Asn Tyr Pro Gin Glu Glu Ala 
35 40 45 



CGTGACCAGA TCCAGCTTGC 

GGAACAAAC ATG CAG 
Met Gin 
1 

GCG GCG GCG GTC CCC 
Ala Ala Ala Val Pro 
15 

ACT CCG GCG GAG CCG 
Thr Pro Ala Glu Pro 
30 

ACG CTC AAT GAG ATG 
Thr Leu Asn Glu Met 
50 



576 



624 



672 



720 



726 



60 
115 

163 

211 

259 



WO 98/46755 



PCT/US98/07894 



- 112- 



TTT CGA GAG GTG GAG GAG CTG ATG GAA GAC ACT CAG CAC AAA CTG CGC 
Phe Arg Glu Val Glu Glu Leu Met Glu Asp Thr Gin His Lys Leu Arg 
55 60 * 65 

AGT GCC GTG GAG GAG ATG GAG GCG GAA GAA GCA GCT GCT AAA ACG TCC 
Ser Ala Val Glu Glu Met Glu Ala Glu Glu Ala Ala Ala Lys Thr Ser 
70 75 80 

TCT GAG GTG AAC CTG GCA AGC TTA CCT CCC AAC TAT CAC AAT GAG ACC 
Ser Glu Val Asn Leu Ala Ser Leu Pro Pro Asn Tyr His Asn Glu Thr 
85 90 95 

AGC ACG GAG ACC AGG GTG GGA AAT AAC ACA GTC CAT GTG CAC CAG GAA 
Ser Thr Glu Thr Arg Val Gly Asn Asn Thr Val His Val His Gin Glu 
100 105 no 

GTT CAC AAG ATA ACC AAC AAC CAG AGT GGA CAG GTG GTC TTT TCT GAG 
Val His Lys He Thr Asn Asn Gin Ser Gly Gin Val Val Phe Ser Glu 
115 120 125 130 

ACA GTC ATT ACA TCT GTA GGG GAT GAA GAA GGC AAG AGG AGC CAT GAA 
Thr Val He Thr Ser Val Gly Asp Glu Glu Gly Lys Arg Ser His Glu 
!35 140 145 

TGT ATC ATT GAT GAA GAC TGT GGG CCC ACC AGG TAC TGC CAG TTC TCC 
Cys He He Asp Glu Asp Cys Gly Pro Thr Arg Tyr Cys Gin Phe Ser 
!50 155 " i 6 o 

AGC TTC AAG TAC ACC TGC CAG CCA TGC CGG GAC CAG CAG ATG CTA TGC 
Ser Phe Lys Tyr Thr Cys Gin Pro Cys Arg Asp Gin Gin Met Leu Cys 
165 170 175 

ACC CGA GAC AGT GAG TGC TGT GGA GAC CAG CTG TGT GCC TGG GGT CAC 
Thr Arg Asp Ser Glu Cys Cys Gly Asp Gin Leu Cys Ala Trp Gly His 
180 185 190 

TGC ACC CAA AAG GCC ACC AAA GGT GGC AAT GGG ACC ATC TGT GAC AAC 
Cys Thr Gin Lys Ala Thr Lys Gly Gly Asn Gly Thr He Cys Asp Asn 
195 200 



205 210 



CAG AGG GAT TGC CAG CCT GGC CTG TGT TGT GCC TTC CAA AGA GGC CTG 
Gin Arg Asp Cys Gin Pro Gly Leu Cys Cys Ala .Phe Gin Arg Gly Leu 



215 220 



225 



CTG TTC CCC GTG TGC ACA CCC CTG CCC GTG GAG GGA GAG CTC TGC CAT 
Leu Phe Pro Val Cys Thr Pro Leu Pro Val Glu Gly Glu Leu Cys His 



230 235 



240 



307 



355 



403 



451 



499 



547 



595 



643 



691 



739 



787 



835 



GAC CCC ACC AGC CAG CTG CTG GAT CTC ATC ACC TGG GAA CTG GAG CCT 883 
Asp Pro Thr Ser Gin Leu Leu Asp Leu He Thr Trp Glu Leu Glu Pro 
245 250 " 255 

GAA GGA GCT TTG GAC CGA TGC CCC TGC GCC AGT GGC CTC CTA TGC CAG 931 
Glu Gly Ala Leu Asp Arg Cys Pro Cys Ala Ser Gly Leu Leu Cys Gin 
26 ° 265 270 
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1027 



1075 



1123 



CCA CAC AGC CAC AGT CTG GTG TAC ATP Trr- 

* - fc « „2 £ S » £ S S E S S 

5 280 285 29Q 

= = Z 2 = 2 = - - - - - - - - - 

300 305 

' 0 = £ = 5 S 5 s - - - - - « - = « 

315 320 

, 5 = = 5 = « E - - - - E - - - « 

330 335 

ZU 340 345 

TCACTGGTAG ATGTGCAATA GAAATGGCTA ATTTATTTTC CCAGGAGTGT CCCCAAGTGT 1236 
^ GGAATGGCCG CAGCTCCTTC CCAGTAGCTT ^CCTCTGGC TTGACAAGGT ACAGTGCAGT 1296 
ACAX^CTTC CAGCCGCCCT GCTTCTCTGA CTTGGGAAAG ACAGGCATGG CGGGTAAGGG X3S 6 
CAGCGGTGAG TCGTCCCTCG CTGTTGCTAG AAACGCTGTC TTGTTCTTCA TGGATGGAAG 14l6 
30 ATTTGTTTGA AGGGAGAGGA TGGGAAGGGG TGAAGTCTGC TCATGATGGA TTTGGGGGAT 1476 
ACAGGGAGGA GGATGCCTGC CTTGCAGACG TGGACTTGGC AAAATGTAAC CTTTGCTTTT 1536 
^ GTCTTGCGCC GCTCCCATGG GCTGAGGCAG TGGCTACACA AGAGCTATGC TGCTCTGTGG 1S96 
CCTCCCACAT ATTCATCCCT GTGTTTCAGC TCCTACCTCA CTGTCAGCAC AGCCCTTCAT 1656 
AGCCACGCCC CCTCTTGCTC ACCACAGCCT AGGAGGGGAC CAGAGGGGAC TTCTCTCAGA 1716 
40 GCCCCATGCT CTCTCTCTCA ACCCCATACC AGCCTCTGTG CCAGCGACAG TCCTTCCAAA 1776 
TGGAGGGAGT GAAATCCTTT GGTTTAATTA TTTTCTCCTT CAAGGCACGC CTGCCACTAA 1 836 
^ GGTCAGGCTG ACTTGCATGT CCCTCTAACG TTCGTAGCAG TGTGGTGGAC ACTGTCTTCC 1896 
ACCGACTGCT TCAATACCTC TGAAAGCCAG TGCTCGGAGT GCAGTTCGTG TAAATTAATT 1956 
TGCAGGAAGT ATACTTGGCT AATTGTAGGG CTAGGATTGT GAATGAAATT TGCAAAGTCG 2016 
50 CTTAGCAACA ATGGAAAGCC TTTCTCAGTC ACACCGAGAA GTCACAACCA AGCCAGGTTG 2076 
TGTAGAGTAC AGCTGTGACA TACAGACAGA AGAAGGCTGG GCTGGATGTC AGGCCTCAGA 2136 
^ TGACGGTTTC AGGTGCCAGG AACTATTACC ATTCTGTATC TATCCAGAGT TATTAAAATT 2196 
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GAAAGTTGCA CACATTTGTA TAAGCATGCC TTTCTCCTGA GTTTTAAATT ATATGTATAC 2256 
ACAAACATGT GGCCCTCAAA GATCATGCAC AAACCACTAC TCTTTGCTAA TTCTTGGACT 2316 
5 TTTCTCTTTG ATTTTCAATA AATACAAATC CCCTTCATGC AAAAAAAAAA AAAAAGGGCG 2376 
GCCGC 23Q1 

10 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

20 

Met Gin Arg Leu Gly Gly lie Leu Leu Cys Thr Leu Leu Ala Ala Ala 
1 5 10 15 

Val Pro Thr Ala Pro Ala Pro Ser Pro Thr Val Thr Trp Thr Pro Ala 
25 20 25 30 

Glu Pro Gly Pro Ala Leu Asn Tyr Pro Gin Glu Glu Ala Thr Leu Asn 
35 40 45 

30 Glu Met Phe Arg Glu Val Glu Glu Leu Met Glu Asp Thr Gin His Lys 
50 55 60 

Leu Arg Ser Ala Val Glu Glu Met Glu Ala Glu Glu Ala Ala Ala Lys 
g 5 70 75 80 

35 

Thr Ser Ser Glu Val Asn Leu Ala Ser Leu Pro Pro Asn Tyr His Asn 
85 90 95 

Glu Thr Ser Thr Glu Thr Arg Val Gly Asn Asn Thr Val His Val His 
40 100 105 no 

Gin Glu Val His Lys lie Thr Asn Asn Gin Ser Gly Gin Val Val Phe 
115 120 125 

45 Ser Glu Thr Val He Thr Ser Val Gly Asp Glu Glu Gly Lys Arg Ser 
130 135 140 

His Glu Cys He He Asp Glu Asp Cys Gly Pro Thr Arg Tyr Cys Gin 
5Q 145 150 155 " 160 

Phe Ser Ser Phe Lys Tyr Thr Cys Gin Pro Cys Arg Asp Gin Gin Met 
165 170 175 

Leu Cys Thr Arg Asp Ser Glu Cys Cys Gly Asp Gin Leu Cys Ala Trp 
55 180 185 190 
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«* «. s «. ^ „. ^ Gly Gly asn nr rar ns ^ 

^ " 205 

«p S «- - *, c. «. Wo 01y _ cys ^ ^ ^ Mn ^ 
, 0 S ^ - - •» « <*■ - n. u. jj. Z «. oly Glu teu 

- VP *. £ s „ oln L=u _ ^ ^ na ^ ^ ^ ^ 

15 - - «. s Ma Leu lsp ^ a pro ^ au s=c ^ 2 ^ 

«. 5 „, s „ Kis _ vai ^ hic ^ ^ ™ ^ ^ 

- n ° U 28S 

X - - - «• £ «. «. S „ «. tou ,„ ^ 01u 

300 

^ - „ „. ^ Glu s val oiy ph> ue ^ giu ^ ^ ^ ^ 
- - u. «. „ g s „ leu Ma J net au ph> oiu 

J C 335 

30 p~ «, P „ Glu ser ^ oiy ciy oiu mu ^ 

345 

(2) INFORMATION FOR SEQ ID NO : 1 8 ; 

35 (i) SEQUENCE CHARACTERISTICS- 

Itl LENGTH: 1047 bas « pairs 
. B) TYPE: nucleic acid 

(C) STRANDEDNESS : sino1 e 

(D) TOPOLOGY: linear 

40 (ii> MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
45 {B) LOCATION: 1..1047 

(Xi) SEQUENCE DESCRIPTXON: SEQ ID NO- 18 • 

50 = S 5 - z S 2 = £ s - - - - - « 
= = = = 5 £ 5 = 2 ~ - - . cT « 

30 



96 



WO 98/46755 
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116 
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= = = 5 = 2 2 2 2 = - - ~ - « « 

55 60 

= = = = = = s - - - a s - - - » 

= = = = = s = - - - s « = ^ 

90 95 
S S £ S K £ « « « - - ACA GTC CAT GTG C AC 

100 9 Val Asn Asn Thr Val His val His 

05 110 

2 2 £ £ £ 2 2 z 2 2 - - 2 - - » 
= = S 2 2 2 e - - - - - I « - = 

135 140 

2 2 2 s s 5 2 2 £ « « « -~ - « 

14 S » taiU As P ^ G1 y Pro Thr Arg Tyr Cys Gin 

155 

2 2 Ser £ £ £ £ « £ P CCA TGC « CAG CAG Z 

J s ryr Thr Cys Gin Pro Cys Arg Asp Gin Gin Met 

170 175 

= 2 2 2 5 2 2 - - - - - - - - - 

185 190 

2 2 2 2 2 2 2 2 £ 2 5 = - - - - 
2 S = 5 5 5 2 = 5 2 2 5 = 2 2 - 

215 220 

2 2 2 2 2 2 2 £ 2 2 2 GTC - « - ™ 

225 fj CyS Thr Pro L «« Pro Val Glu Gly Glu Leu 

235 

2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 
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Su J£ qxu Qly i C J f G CGA tgc ccc TGC Geo 

2 Y Ala Leu Asp Arg cys JGC GCC AGT QGC 

265 Y Ala Ser Gly Leu Leu 

CAG CCA CAC *nn 27 ° 
C ^ «n Pro £ «C AGT CTG GTG TAC ATG TGC AAr* 

27s Has Ser Leu ^ G TGC AAG CCA GCC TTC 

280 St <*» Lys Pro Ala Phe 

} GTG GGC AGC CAT rnn 285 

CCG GAT GAG TAC GAA 3 °° 
Pr ° Asp GIu *J GAA GAT GTT GGC TTC ATA «v 

«- JJJ V,, 01y ph . ™ « « „ GAG 

CTG rz* 3l = n Glu 

- °" 5 S S « « « GCC M0 «. 

». ir9 s " - - s G1 " ™ s s s - 

- S S S Glu £ « « « « GAG GAG ATT 
25 34 ° ly JJj G ^ Glu Glu He 1047 
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What is claimed: 

1 . An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which 
is at least 60% homologous to a nucleotide sequence of SEQ ID NO: 1 , SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID 
NO: 10, SEQ ID NO: 12, the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98634, the DNA insert of the plasmid deposited with ATCC 
as Accession Number 98633, the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98452, or a complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 1 000 
nucleotides of a nucleic acid comprising the nucleotide sequence of SEQ ID 
NO: 1 , SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:9, SEQ ID NO:10, SEQ ID NO:12, the DNA insert of the plasmid deposited 
with ATCC as Accession Number 98634, the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98633, the DNA insert of the 
plasmid deposited with ATCC as Accession Number 98452, or a complement 
thereof; 

c) a nucleic acid molecule which encodes a polypeptide comprising 
an amino acid sequence at least about 60% homologous to the amino acid 
sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, or SEQ ID NO: 1 1 , the 
amino acid sequence encoded by the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98634, the amino acid sequence encoded by the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98633, 
or theamino acid sequence encoded by the DNA insert of the plasmid deposited 
with ATCC as Accession Number 98452; 

d) a nucleic acid molecule which encodes a fragment of a 
polypeptide comprising the amino acid sequence of SEQ ID NO:2, SEQ ID 
NO:5, SEQ ID NO:8, SEQ ID NO:l 1, the polypeptide encoded by the DNA 
insert of the plasmid deposited with ATCC as Accession Number 98634, the 
polypeptide encoded by the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98633, or the polypeptide encoded by the DNA insert of the 
plasmid deposited with ATCC as Accession Number 98452, wherein the 
fragment comprises at least 15 contiguous amino acid residues of the amino acid 
sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO: 1 1, the 
polypeptide encoded by the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98634, the polypeptide encoded by the DNA insert of the 
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plasmid deposited with ATCC as Accession Number 98611 or , h , 
encoded by the DNA insert of th* m m , ' the P o1 Wide 

Number 98452; and ' P,aSm ' d With ATCC « Accession 

allelic variant of a ^ m ° ,eCU,e Which enc ° d « a naturally occurring 
^^^^^^^^^ 

98634, the polypeptide encoded by the DNA insert of th, „. m / 

ATCC as Accession Number 98633 or th 7 ? ^ With 

10 in^r, r.u , n number y8633, or the polypeptide encoded by the DNA 

•nsert of the plas mi d deposited with ATCC as Accession Number 98452 
wherein the nucleic acid molecule hybridizes to a nucleic acid mole 1 e 
comprising SEQ ID NO:l, SEQ ID NO 3 SEO ID Nn a * '°™° leCU,e 

N 0:7.SEQIDNO:9,SE Q IDL,r SE Q^ 

Plasmid deposited with ATCC as Accession N^^^^ ^ , 

15 .the plasmid deposited with ATCC „ a, ■ T, mSert ° f 

H^ucuwiin At LC as Accession Number 98631 ™-th„r»MA 

the plasmid deposited with ATCC M A,,. ■ m ! ' A 108611 ° f 

thereof; and ^ NUmber 98452 ' or a complement 

1 ATCC 1 A yPCPtld r nCOded ^ ° NA inSCrt of ^ P'asmid deposited 
iWrt f?K , ° n NUmber 98634> thC P0l W tide ^coded by the DNA 

>n e rt of the plasmid deposited with ATCC as Accession Number 98633 oT 

Polypeptide encoded by the DNA insert of fh.r.. * 
Accession Number Q J„ ' ^ ^ depOShed with AT CC as 



Accession Number 98452 
35 

3. 



acid se^ncJ 1 * ""^ "* m ° , °° lh ^ ' **- vecor nucieic 
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4. The nucleic acid molecule of claim 1 further comprising nucleic acid 
sequences encoding a heterologous polypeptide. 

A host cell which contains the nucleic acid molecule of claim 1 . 

The host cell of claim 5 which is a mammalian host cell. 

A non-human mammalian host cell containing the nucleic acid molecule 



An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence 
of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO: 1 1 , the polypeptide 
encoded by the DNA insert of the plasmid deposited with ATCC as Accession 
Number 98634, the polypeptide encoded by the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98633, or the polypeptide encoded 
by the DNA insert of the plasmid deposited with ATCC as Accession Number 
98452, wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1, the polypeptide 
encoded by the DNA insert of the plasmid deposited with ATCC as Accession 
Number 98634, the polypeptide encoded by the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98633, or the polypeptide encoded 
by the DNA insert of the plasmid deposited with ATCC as Accession Number 
98452; 

b) a naturally occurring allelic variant of a polypeptide comprising 
the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ 
ID NO: 1 1, the polypeptide encoded by the DNA insert of the plasmid deposited 
with ATCC as Accession Number 98634, the polypeptide encoded by the DNA 
insert of the plasmid deposited with ATCC as Accession Number 98633, or the 
polypeptide encoded by the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98452, wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule comprising SEQ ID NO:l, 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, 
SEQ ID NO:10, SEQ ID NO:12, the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98634, the DNA insert of the plasmid deposited 



5. 
6. 
7. 

of claim 1 . 
8. 
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with ATCC as Accession Number 98633, or the DNA insert of the plasmid 
deposed with ATCC as Accession Number 98452 under stringent conditions- 

c) a polypeptide which is encoded by a nucleic acid molecule 
contpnsmg a nucleotide sequence which is at least 60% homologous to a nucleic 
5 and comprising the nucleotide sequence of SEQ ID NO: I, SEQ ID NO-3 SEO 

~e^ 

NO.12, the DNA msert of the plasmid deposited with ATCC as Accession 
Number 98634, the DNA insert of the plasmid deposited with ATCC as 
Accessaon Number 98633, or the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98452; and 

600/ * 2 P ° lypep,ide com P risi "S an amino acid sequence which is at least 
60/o homologous to the amino acid sequence of SEQ ID NO-2 SEO ID NO -5 

the ' ID m ' U ' P ° Iypeptide encoded b y the DNA imert^f ' 

the plasmid deposited with ATCC as Accession Number 98634, the polypeptide 

Number 98633, or the poIypept.de encoded by the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98452. 



15 



20 



25 



of SFO in o h t iSO,ated P ° lypeptide 0f claim 8 uprising the amino acid sequence 
en 1 H K u t Q ID NO ' 5, SEQ ID N ° :8 ' ID NO: 1 1, the polypeptide 

98634, the polypeptide encoded by the DNA insert of the plasmid deposited with ATCC 
» Accession Number 98633, or the polypeptide encoded by the DNA insert of 1 
Plasmid deposited with ATCC as Accession Number 98452 



sequences 
11. 



10- The polypeptide of claim 8 further comprising heterologous amino acid 
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of: 



35 



An antibody which selectively binds to a polypeptide of claim 8. 
12. A method for producing a polypeptide selected from the group consisting 
a) a polypeptide comprising the amino acid sequence of SEO ID 

98*t ^ 7" ^ P,aSmid deP ° Sited ^ ATCC - Ac ~-°n Number 
98634, the polypeptide encoded by the DNA insert of the plasmid deposited with 
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ATCC as Accession Number 98633, the polypeptide encoded by the DNA insert 
of the plasmid deposited with ATCC as Accession Number 98452; 

b) a fragment of a polypeptide comprising the amino acid sequence 
of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO: 1 1, the polypeptide 
encoded by the DNA insert of the plasmid deposited with ATCC as Accession 
Number 98634, the polypeptide encoded by the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98633, or the polypeptide encoded 
by the DNA insert of the plasmid deposited with ATCC as Accession Number 
98452, wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO: 1 1 , the polypeptide 
encoded by the DNA insert of the plasmid deposited with ATCC as Accession 
Number 98634, the polypeptide encoded by the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98633, or the polypeptide encoded 
by the DNA insert of the plasmid deposited with ATCC as Accession Number 
15 98452; and 

c) a naturally occurring allelic variant of a polypeptide comprising 
the ammo acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ 
ID NO: 1 1 , the polypeptide encoded by the DNA insert of the plasmid deposited 
with ATCC as Accession Number 98634, the polypeptide encoded by the DNA 
insert of the plasmid deposited with ATCC as Accession Number 98633,or the 
polypeptide encoded by the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98452, wherein the polypeptide is encoded by a nucleic acid 
molecule winch hybridizes to a nucleic acid molecule comprising SEQ ID NO l 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID N0 9 
SEQ ID NO:10, SEQ ID NO: 12. the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98634, the DNA insert of the plasmid deposited 
with ATCC as Accession Number 98633, or the nucleotide sequence of the DNA 
insert of the plasmid deposited with ATCC as Accession Number 98452 under 
stringent conditions; 

30 

comprising culturing the host cell of claim 5 under conditions in which the 
nucleic acid molecule is expressed. 

13. A method for detecting the presence of a polypeptide of claim 8 in a 
35 sample comprising: 

a) contacting the sample with a compound which selectively binds to the 
polypeptide; and 
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b) determining whether the compound binds to the polypeptide in the 

sample to thereby detect the presence of a polypeptide of claim 8 in the 
sample. 

14 Themethod of claim 13, wherein the compound which binds to the 
polypeptide is an antibody. 

c . a ; m . 15 '„ • A C ° mpriSing 3 COm P ound which lively binds to a polypeptide of 
claim 8 and instructions for use. 

16 A method for detecting the presence of a nucleic acid molecule in claim 1 
in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectlve, y hybridizes to the nucleic acid molecule; and 
^ b) determining whether the nucleic acid probe or primer binds to a nucleic 

acid molecule in the sample to thereby detect the presence of a nucleic 
acid molecule of claim 1 in the sample. 

17. The method of claim 16, wherein the sample comprises mRNA 
molecules and is contacted with a nucleic acid probe. 

18. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of claim 1 and instructions for use. 

25 19. A method for identifying a compound which binds to a polypeptide of 

claim 8 comprising: y 

a) contacting the polypeptide, or a cell expressing the polypeptide with a 
test compound; and 

b) determining whether the polypeptide binds to the test compound. 

oolvnentl ■ 7^ " ^ 19 ' Wherdnthe bindi "g ^ the test compound to the 
polypept.de is detected by a method selected from the group consisting of 

a) detection of binding by direct detection of test compound/polypeptide 
binding; 

35 b) detection of binding using a competition binding assay; and 

c) detection of binding using an assay for CRSP activity. 
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21. A method of modulating the activity of a polypeptide of claim 8 
compnsing contacting the polypeptide or a cell expressing the polypeptide with a 



A method fo ' identifying a compound which modulates the activity of a 
polypeptide of claim 8 comprising: y 

a) contacting a polypeptide of claim 8 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
polypeptide to thereby identify a compound which modulates the activity 
ol the polypeptide. 
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Figure 3 A 

CCX3GACGCGTGGGCGGCACX5GTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTCTT TCTTTCTCCCTCTT 7 9 

MMALGAAGATRVFVAM 16 

GAGTCCTTCTGAG ATG ATG GCT CTG GGC GCA GCG GGA GCT ACC CGG GTC TTT GTC GCG ATG 140 

VAAALGGHPLLGVSATLNSV 36 

GTA GCG GCG GCT CTC GGC GGC CAC CCT CTG CTG GGA GTG AGC GCC ACC TTG AAC TCG GTT 200 

LNSNAIKNLPPPLGGAAGHP S6 

CTC AAT TCC AAC GCT ATC AAG AAC CTG CCC CCA CCG CTG GGC GGC GCT GCG GGG CAC CCA 260 

GSAV SAAPGILYPGGNKYQT 76 

GGC TCT GCA GTC AGC GCC GCG CCG GGA ATC CTG TAC CCG GGC GGG AAT AAG TAC CAG ACC 320 

* DNYQP YPCAEDEECGTDEY 96 

ATT GAC AAC TAC CAG CCG TAC CCG TGC GCA GAG GAC GAG GAG TGC GGC ACT GAT GAG TAC 3 80 



PTRGGDAGVQICLACRK 
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TGC GCT AGT CCC ACC CGC GGA GGG GAC GCA GGC GTG CAA ATC TGT CTC GCC* TGC AGG AAG 4 4 0 

RRKRCMRHAMCCPGNYCKNG136 
CGC CGA AAA CGC TGC ATG CGT CAC GCT ATG TGC TGC CCC GGG AAT TAC TGC AAA AAT GGA 500 



N H 
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ATA TGC GTG TCT TCT GAT CAA AAT CAT TTC CGA GGA GAA ATT GAG GAA ACC ATC ACT GAA 560 

SFGNDHSTLDGYSRRTTLSS176 
AGC TTT GGT AAT GAT CAT AGC ACC TTG GAT GGG TAT TCC AGA AGA ACC ACC TTG TCT TCA 620 

KMYHTKGQEGSVCLRSSDCA196 
AAA ATG TAT CAC ACC AAA GGA CAA GAA GGT TCT GTT TGT CTC CGG TCA TCA GAC TGT GCC 6 BO 

SGLCCARHFWSKICKPVLKE216 
TCA GGA TTG TGT TGT GCT AGA CAC TTC TGG TCC AAG ATC TGT AAA CCT GTC CTG AAA GAA 74 0 

G QVCTKHRRKGSHGLEIFQR236 
GGT CAA GTG TGT ACC AAG CAT AGG AGA AAA GGC TCT CAT GGA CTA GAA ATA TTC CAG CGT 800 

CYCGEGLSCRIQKDHHQASN256 
TGT TAC TGT GGA GAA GGT CTG TCT TGC CGG ATA CAG AAA GAT CAC CAT CAA GCC AGT AAT 860 

SSRLHTCQRH* 267 
TCT TCT AGG CTT CAC ACT TGT CAG AGA CAC TAA 893 

ACCAGCTATCCAAAATGCAGTGAACTCCTTTTATATAATA^ 972 

CCTAAGG ATAT ACAAGTTCTG TGGTTTCAGTTAAG CATTCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAG CTTTG 1051 

TTTCTTTATGG AACTCCCCTGTG ATTGCAGTAAATT ACTGTATTGTAAATTCTCAGTG TGGCACTTACCTGTAAATG CA 1130 

ATG AAACTTTTAATTATTTTITCT AAAGCTG CTGCACTG CCTATTTTTCCTCTTGTTflT GTAAATTTTTGTACACATTGA 1209 

TTG TTATCTTGACTGACAAATATTCTATATTGAACTG AAGTAAATCATTTCAGCTTATAGTTCTTA^ CAT AACCCT 1288 

TTACCCCATTTNATTCTAGAGTCNAGAACGCAAGGATCTCTTGGAATGACA 1367 
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Figure 5 



M 



1 



CTCG AGG CCAAAATTCGG CACG AGG CCCGG CTGTGGTCTAGCATAAAGGCGG AGCCCAGAAGAAGGCG CGGGGT ATG 77 

GEASPPAPARRHLLVLLLLL 21 

GGA GAA GCC TCC CCA CCT GCC CCC GCA AGG CGG CAT CTG CTG GTC CTG CTG CTG CTC CTC 137 

STLVI PSAAAPIHDADAQES 41 

TCT ACC CTG GTG ATC CCC TCC GCT GCA GCT CCT ATC CAT GAT GCT GAC GCC CAA GAG AGC 197 

S LGLTGLQS LLQGFSRLFLK 61 

TCC TTG GGT CTC ACA GGC CTC CAG AGC CTA CTC CAA GGC TTC AGC CGA CTT TTC CTG AAA 2S7 

GNLLRG IDSLFSAPMDFRGL 81 

GGT AAC CTG CTT CGG GGC ATA GAC AGC TTA TTC TCT GCC CCC ATG GAC TTC CGG GGC CTC 317 

PGNYHKEENQEHQLGNNTLS 101 

CCT GGG AAC TAC CAC AAA GAG GAG AAC CAG GAG CAC CAG CTG GGG AAC AAC ACC CTC TCC 377 

S HLQI DKMTDNKTGEVL I S E 121 

AGC CAC CTC CAG ATC GAC AAG ATG ACC GAC AAC AAG ACA GGA GAG GTG CTG ATC TCC GAG 437 

NVVASIQPAEGSFEGDLKVP 141 

AAT GTG GTG GCA TCC ATT CAA CCA GCG GAG GGG AGC TTC GAG GGT GAT TTG AAG GTA CCC 4 97 

RMEEKEALVP IQKATDS FHT 161 

AGG ATG GAG GAG AAG GAG GCC CTG GTA CCC ATC CAG AAG GCC ACG GAC AGC TTC CAC ACA 557 

ELHPRVAFWI IKLPRRRSHQ 181 

GAA CTC CAT CCC CGG GTG GCC TTC TGG ATC ATT AAG CTG CCA CGG CGG AGG TCC CAC CAG 617 

DAL.EGGHWLSEKRHRLQAI R201 

GAT GCC CTG GAG GGC GGC CAC TGG CTC AGC GAG AAG CGA CAC CGC CTG CAG GCC ATC CGG 677 

DGLRKGTHKDVLEEGTESSS 221 

GAT GGA CTC CGC AAG GGG ACC CAC AAG GAC GTC CTA GAA GAG GGG ACC GAG AGC TCC TCC 737 

H SRLSPRKTHLLYI LRPSRQ241 

CAC TCC AGG CTG TCC CCC CGA AAG ACC CAC TTA CTG TAC ATC CTC AGG CCC TCT CGG CAG 7 97 

L * 243 

CTG TAG B03 

GGGTGGGGACCGGGG AG CAC CTG CCTGTAG CCCCCATCAG ACCCTG CCCCAAGCACCATATGG AAAT AAAGTTCTTTCT 882 

TACATCTAAAAAAAAAAAAAAAAAAAAAAAAAAAATTGGCGGCCGC 928 
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FIGURE 6 
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Figure 8A 

FGTCGACCCACGCGTCCGCTGTGGCAGCCCAGCTACCGGTCGTGACCACATCCAGCTTGCAGCTCAGCTTTGTTCATTC 79 

MQRLGGILLCTL 12 

G AATTGGGCGG CGG C CAG CG CGGAACAAAC ATG CAG CGG CTC GGG GGT ATT TTG CTG TGT ACA CTG 145 

LAAAVPTAPAPSPTVTWTPA 32 

CTG GCG GCG GCG GTC CCC ACT GCT CCT GCT CCT TCC CCG ACG GTC ACT TGG ACT CCG GCG 205 

EPGPALNYPQEEATLNEMFR 52 

GAG CCG GGC CCA GCT CTC AAC TAC CCT CAG GAG GAA GCT ACG CTC AAT GAG ATG TTT CGA .26 5 

EVEELMEDTQHKLRSAVEEM 72 

GAG GTG GAG GAG CTG ATG GAA GAC ACT CAG CAC AAA CTG CGC AGT GCC GTG GAG GAG ATG 325 

EAEEAAAKTSSEVNLASLPP 92 

GAG GCG GAA GAA CCA GCT GCT AAA ACG TCC TCT GAG GTG AAC CTG GCA AGC TTA CCT CCC 3 85 

NYH.NETSTETRVGNNTVHVH 112 

AAC TAT CAC AAT GAG ACC AGC ACG GAG ACC AGG GTG GGA AAT AAC ACA GTC CAT GTG CAC 44S 

QEVHK XTNNQSGQVV'FSETV 132 

. CAG GAA GTT CAC AAG ATA ACC AAC AAC CAG AGT GGA CAG GTG GTC TTT TCT GAG ACA GTC SOS 

ITSVGDEEGKRSHECIIDED 152 

ATT ACA TCT GTA GGG GAT GAA GAA GGC AAG AGG AGC CAT GAA TGT ATC ATT GAT GAA GAC 565 

CGPTRYCQFSSFKYTCQ PCR 172 

TGT GGG CCC ACC AGG TAC TGC CAG TTC TCC AGC TTC AAG TAC ACC TGC CAG CCA TGC CGG 625 

DQQMLCTRDSECCGDQLCAW 192 

GAC CAG CAG ATG CTA TGC ACC CGA GAC AGT GAG TGC TGT GGA GAC CAG CTG TGT GCC TGG 6 65 

ghct'qkatkggngticdnqr 212 

GGT CAC TGC ACC CAA AAG GCC ACC AAA GGT GGC AAT GGG ACC ATC TGT GAC AAC CAG AGG 745 

DCQPGLCCAFQRGLLFPVCT 232 

GAT TGC CAG CCT GGC CTG TGT TGT GCC TTC CAA AGA GGC CTG CTG TTC CCC GTG TGC ACA 80S 

P LPVEG E L CHD PTSQLLDL I 252 

CCC CTG CCC GTG GAG GGA GAG CTC TGC CAT GAC CCC ACC AGC CAG CTG CTG GAT CTC ATC 865 

TWELEPEGALDRCPCASGLL 272 

ACC TGG GAA CTG GAG CCT GAA GGA GCT TTG GAC CGA TGC CCC TGC GCC AGT GGC CTC CTA 925 

CQPHSHS LVYMCKPAFVGSH 292 

TGC CAG CCA CAC AGC CAC AGT CTG GTG TAC ATG TGC AAG CCA GCC TTC GTG GGC AGC CAT 985 

DHSEESQ'PREAPDEYEDVG 312 

GAC CAC AGT GAG GAG AGC CAG CTG CCC AGG GAG GCC CCG GAT GAG TAC GAA GAT GTT GGC 1045 



FIGEVRQELEDLERSLAQEM 
TTC ATA GGG GAA GTG CGC CAG GAG CTG GAA GAC CTG GAG CGG AGC CTA GCC CAG GAG ATG 
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